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Transgenic non-human animals capable of producing heterologous antibodies 



TECHNICAL FIELD 
10 The invention relates to transgenic non-human 

animals capable of producing heterologous antibodies, 
transgenes used to produce such transgenic animals, 
transgenes capable of functionally rearranging a heterologous 
D gene in V-D-J recombination, immortalized B-cells capable of 
15 producing heterologous antibodies, methods and transgenes for 
producing heterologous antibodies of multiple isotypes, 
methods and transgenes for inactivating or suppressing 
expression of endogenous immunoglobulin loci, methods and 
transgenes for producing heterologous antibodies wherein a 
20 variable region sequence comprises somatic mutation as 

compared to germline rearranged variable region sequences, and 
transgenic nonhuman animals which produce antibodies having a 
human primary sequence and which bind to human antigens. 

25 

BACKGROUND OF THE INVENTION 
One of the major impediments facing the development 
of in vivo therapeutic and diagnostic applications for 
monoclonal antibodies in humans is the intrinsic 

30 immunogenicity of non-human immunoglobulins. For example, when 
immunocompetent human patients are administered therapeutic 
doses of rodent monoclonal antibodies, the patients produce 
antibodies against the rodent immunoglobulin sequences; these 
human anti-mouse antibodies (HAMA) neutralize the therapeutic 

35 antibodies and can cause acute toxicity. Hence, it is 

desirable to produce human immunoglobulins that are reactive 
with specific human antigens that are promising therapeutic 
and/or. diagnostic targets. However, producing human 



WO 93/12227 



PCT/US92/10983 



immunoglobulins that bind specifically -with human antigens is 
problematic. 

The present technology for generating monoclonal 
antibodies involves pre-exposing, or priming, an animal 

5 (usually a rat or mouse) with antigen, harvesting B-cells from 
that animal, and generating a library of hybridoma clones. By 
screening a hybridoma population for. antigen binding 
specificity (idiotype) and also screening for immunoglobulin 
class (isotype) , it is possible to select hybridoma clones 

10 that secrete the desired antibody. 

However, when present methods for generating 
monoclonal antibodies are applied for the purpose of 
generating human antibodies that have binding specificities 
for human antigens, obtaining B-lymphocytes which produce 

15 human immunoglobulins a serious obstacle, since humans will 
typically not make immune responses against self -antigens . 

Hence, present methods of generating human 
monoclonal antibodies that are specifically reactive with 
human antigens are clearly insufficient. It is evident that 

20 the same limitations on generating monoclonal antibodies to 
authentic self antigens apply where non-human species are used 
as the source of B-cells for making the hybridoma. 

The construction of transgenic animals harboring a 
functional heterologous immunoglobulin transgene are a method 

25 by which antibodies reactive with self antigens may be 
produced. However, in order to obtain expression of 
therapeutically useful antibodies, or hybridoma clones 
producing such antibodies, the transgenic animal must produce 
transgenic B cells that are capable of maturing through the B 

30 lymphocyte development pathway. Such maturation requires the 
presence of surface IgM on the transgenic B cells, however 
isotypes other than IgM are desired for therapeutic uses. 
Thus, there is a need for transgenes and animals harboring 
such transgenes that are able to undergo functional V-D-J 

35 rearrangement to generate recombinational diversity and 

junctional diversity. Further, such transgenes and transgenic 
animals preferably include cis-acting sequences that 
facilitate isotype switching from a first isotype that is 
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required for B cell maturation to a subsequent ■ isotype that 
has superior therapeutic utility. 

A number of experiments have reported the use of 
transfected cell lines to determine the specific DNA sequences 
5 required for Ig gene rearrangement (reviewed by Lewis and 
Gellert (1989), Cell . 59, 585-588)- Such reports have 
identified putative sequences and concluded that the 
accessibility of these sequences to the recombinase enzymes 
used for rearrangement is modulated by transcription 

10 (Yancopoulos and Alt (1985), Cell , 4j0, 271-281). The 

sequences for V(D)J joining are reportedly a highly conserved, 
near-palindromic heptamer and a less well conserved AT-rich 
nanomer separated by a spacer of either 12 or 23 bp (Tonegawa 
(1983), Nature , 302 , 575-581; Hesse f et al. (1989), Genes in 

15 Dev. , 2, 1053-1061) . Efficient recombination reportedly 
occurs only between sites containing recombination signal 
sequences with different length spacer regions. 

Ig gene rearrangement, though studied in tissue 
culture cells, has not been extensively examined in transgenic 

20 mice. Only a handful of reports have been published 

describing rearrangement test constructs introduced into mice 
[Buchini, et al. (1987), Nature , 326, 409-411 (unrearranged 
chicken X transgene) ; Goodhart, et al. (1987) , Proc. Natl. 
Acad. Sci. USA , 84/ 4229-4233) (unrearranged rabbit k gene); 

25 and Bruggemann, et al. (1989), Proc. Nat l. Acad, Sci. USA, 86/ 
6709-6713 (hybrid mouse-human heavy chain)]. The results of 
such experiments, however, have been variable, in some cases, 
producing incomplete or minimal rearrangement of the 
transgene. 

30 Further, a variety of biological functions of 

antibody molecules are exerted by the Fc portion of molecules, 
such as the interaction with mast cells or basophils through 
Fee, and binding of complement by Fey or FC7, it further is 
desirable to generate a functional diversity of antibodies of 

35 a given specificity by variation of isotype. 

Although transgenic animals have been generated that 
incorporate transgenes encoding one or more chains of a 
heterologous antibody, there have been no reports of 
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heterologous transgenes that undergo successful -isotype 
switching. Transgenic animals that cannot switch isotypes are 
limited to producing heterologous antibodies of a single 
isotype, and more specifically are limited to producing an 
isotype that is essential for B cell maturation, such as IgM 
and possibly IgD, which may be of limited therapeutic utility. 
Thus, there is a need for heterologous immunoglobulin 
transgenes and transgenic animals that are capable of 
switching from an isotype needed for B cell development to an 
isotype that has a desired characteristic for therapeutic use. 

Based on the foregoing, it is clear that a need 
exists for methods of efficiently producing heterologous 
antibodies, e.g. antibodies encoded by genetic sequences of a 
first species that are produced in a second species. More 
particularly, there is a need in the art for heterologous 
immunoglobulin transgenes and transgenic animals that are 
capable of undergoing functional V-D-J gene rearrangement that 
incorporates all or a portion of a D gene segment which 
contributes to recombinational diversity. Further, there xs a 
need in the art for transgenes and transgenic animals that can 
support V-D-J recombination and isotype switching so that (1) 
functional B cell development may occur, and (2) 
therapeutically useful heterologous antibodies may be 
produced. There is also a need for a source of B cells which 
can be used to make hybridomas that produce monoclonal 
antibodies for therapeutic or diagnostic use in the particular 
species for which they are designed. A heterologous 
immunoglobulin transgene capable of functional V-D-J 
recombination and/or capable of isotype switching could 

fulfill these needs. 

in accordance with the foregoing object transgenic 
nonhuman animals are provided which are capable of producing a 
heterologous antibody, such as a human antibody. 

Further, it is an object to provide B-cells from 
such transgenic animals which are capable of expressing 
heterologous antibodies wherein such B-cells are immortalized 
to provide a source of a monoclonal antibody specific for a 
particular antigen. 
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In accordance with this foregoing object, it is a 
further object of the invention to provide hybridoma cells 
that are capable of producing such heterologous monoclonal 
antibodies. 

Still further, it is an object herein to provide 
heterologous unrearranged and rearranged immunoglobulin heavy 
and light chain transgenes useful for producing the 
aforementioned non-human transgenic animals. 

Still further, it is an object herein to provide 
methods to disrupt endogenous immunoglobulin loci in the 
transgenic animals. 

Still further, it is an object herein to provide 
methods to induce heterologous antibody production in the 
aforementioned transgenic non-human animal. 

A further object of the invention is to provide 
methods to generate an immunoglobulin variable region gene 
segment repertoire that is used to construct one or more 
transgenes of the invention. 

The references discussed herein are provided solely 
for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to antedate such 
disclosure by virtue of prior invention. 



SUMMARY OF THE INVENTION 
Transgenic nonhuman animals are provided which are 
capable of producing a heterologous antibody, such as a human 
antibody. Such heterologous antibodies may be of various 
isotypes, including: IgGl, IgG2, IgG3, IgG4, IgM, IgAl, IgA2, 
IgA 8ec , igD, of IgE. In order for such transgenic nonhuman 
animals to make an immune response, it is necessary for the 
transgenic B cells and pre-B cells to produce surface-bound 
immunoglobulin, particularly of the IgM (or possibly IgD) 
isotype, in order to effectuate B cell development and 
antigen-stimulated maturation. Such expression of an IgM (or 
IgD) surface-bound immunoglobulin is only required during the 
antigen-stimulated maturation phase of B cell development, an 
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mature B cells may produce other isotypes, although only a 
single switched isotype may be produced at a time. 

Typically, a cell of the B-cell lineage will produce 
only a single isotype at a time, although cis or trans 
5 alternative RNA splicing, such as occurs naturally with the m s 
(secreted n) and m m (membrane-bound ft) forms, and the n and S 
immunoglobulin chains, may lead to the contemporaneous 
expression of multiple isotypes by a single cell. Therefore, 
in order to produce heterologous antibodies of multiple 
10 isotypes, specifically the therapeutically useful IgG, IgA, 
and IgE isotypes, it is necessary that isotype switching 
occur. Such isotype switching may be classical class- 
switching or may result from one or more non-classical isotype 

switching mechanisms. 

15 The invention provides heterologous immunoglobulin 

transgenes and transgenic nonhuman animals harboring such 
transgenes, wherein the transgenic animal is capable of 
producing heterologous antibodies of multiple isotypes by 
undergoing isotype switching. Classical isotype switching 

20 occurs by recombination events which involve at least one 
switch sequence region in the transgene. Non-classical 
isotype switching may occur by, for example, homologous 
recombination between human and human sequences (6- 
associated deletion) . Alternative non-classical switching 

25 mechanisms, such as intertransgene and/or interchromosomal 
recombination, among others, may occur and effectuate isotype 
switching. Such transgenes and transgenic nonhuman animals 
produce a first immunoglobulin isotype that is necessary for 
antigen-stimulated B cell maturation and can switch to encode 

30 and produce one or more subsequent heterologous isotypes that 
have therapeutic and/or diagnostic utility. Transgenic 
nonhuman animals of the invention are thus able to produce, in 
one embodiment, IgG, IgA, and/or IgE antibodies that are 
encoded by human immunoglobulin genetic sequences and which 

35 also bind specific human antigens with high affinity. 

The invention also encompasses B-cells from such 
transgenic animals that are capable of expressing heterologous 
antibodies of various isotypes, wherein such B-cells are 
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immortalized to provide a source of a monoclonal antibody 
specific for a particular antigen. Hybridoma cells that are 
derived from such B-cells can serve as one source of such 

« heterologous monoclonal antibodies. 

5 The invention provides heterologous unrearranged and 

* rearranged immunoglobulin heavy and light chain transgenes 

capable of undergoing isotype switching in vivo in the 
aforementioned non-human transgenic animals or in explanted 
lymphocytes of the B-cell lineage from such transgenic 
10 animals. Such isotype switching may occur spontaneously or be 
induced by treatment of the transgenic animal or explanted B- 
lineage lymphocytes with agents that promote isotype 
switching, such as T-cell-derived lymphokines (e.g., IL-4 and 
IFN Y) . 

15 Still further, the invention includes methods to 

induce heterologous antibody production in the aforementioned 
transgenic non-human animal, wherein such antibodies may be of 
various isotypes. These methods include producing an antigen- 
stimulated immune response in a transgenic nonhuman animal for 

20 the generation of heterologous antibodies, particularly 

heterologous antibodies of a switched isotype (i.e., IgG, IgA, 
and IgE) . 

This invention provides methods whereby the 
transgene contains sequences that effectuate isotype 
25 switching, so that the heterologous immunoglobulins produced 
in the transgenic animal and monoclonal antibody clones 
derived from the B-cells of said animal may be of various 
isotypes. 

This invention further provides methods that 
30 facilitate isotype switching of the transgene, so that 
1 switching between particular isotypes may occur at much higher 

or lower frequencies or in different temporal orders than 
S typically occurs in germline immunoglobulin loci. Switch 

regions may be grafted from various C H genes and ligated to 
35 other C H genes in a transgene construct; such grafted switch 
sequences will typically function independently of the 
associated C H gene so that switching in the transgene 
construct will typically be a function of the origin of the 
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associated switch regions. Alternatively, or in combination 
with switch sequences, ^-associated deletion sequences may be 
linked to various C H genes to effect non-classical switching 
by deletion of sequences between two ^-associated deletion 
5 sequences. Thus, a transgene may be constructed so that a 
particular C H gene is linked to a different switch sequence 
and thereby is switched to more frequently than occurs when 
the naturally associated switch region is used. 

This invention also provides methods to determine 

10 whether isotype switching of transgene sequences has occurred 
in a transgenic animal containing an immunoglobulin transgene. 

The invention provides immunoglobulin transgene 
constructs and methods for producing immunoglobulin transgene 
constructs, some of which contain a subset of germline 

15 immunoglobulin loci sequences (which may include deletions) . 
The invention includes a specific method for facilitated 
cloning and construction of immunoglobulin transgenes, 
involving a vector that employs unique Xhol and Sail 
restriction sites flanked by two unique NotI sites. This 

20 method exploits the complementary termini of Xhol and Sail 

restrictions sites and is useful for creating large constructs 
by ordered concatemerization of restriction fragments in a 
vector. 

The transgenes of the invention include a heavy 
25 chain transgene comprising DNA encoding at least one variable 
gene segment, one diversity gene segment, one joining gene 
segment and one constant region gene segment. The 
immunoglobulin light chain transgene comprises DNA encoding at 
least one variable gene segment, one joining gene segment and 
30 one constant region gene segment. The gene segments encoding 
the light and heavy chain gene segments are heterologous to 
the transgenic non-human animal in that they are derived from, 
or correspond to, DNA encoding immunoglobulin heavy and light 
chain gene segments from a species not consisting of the 
35 transgenic non-human animal. In one aspect of the invention, 
the transgene is constructed such that the individual gene 
segments are unrearranged, i.e., not rearranged so as to 
encode a functional immunoglobulin light or heavy chain. Such 
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unrearranged transgenes permit recombination of .the gene 
segments (functional rearrangement) and expression of the 
resultant rearranged immunoglobulin heavy and/or light chains 
within the transgenic non-human artimal when said animal is 
5 exposed to antigen. 

In one aspect of the invention, heterologous heavy 
and light immunoglobulin transgenes comprise relatively large 
fragments of unrearranged heterologous DNA. Such fragments 
typically comprise a substantial portion of the C, J (and in 

10 the case of heavy chain, D) segments from a heterologous 
immunoglobulin locus. In addition, such fragments also 
comprise a substantial portion of the variable gene segments. 

In one embodiment, such transgene constructs 
comprise regulatory seguences, e.g. promoters, enhancers, 

15 class switch regions, recombination signals and the like, 

corresponding to seguences derived from the heterologous DNA. 
Alternatively, such regulatory seguences may be incorporated 
into the transgene from the same or a related species of the 
non-human animal used in the invention. For example, human 

20 immunoglobulin gene segments may be combined in a transgene 
with a rodent immunoglobulin enhancer seguence for use in a 
transgenic mouse. 

In a method of the invention, a transgenic non-human 
animal containing germline unrearranged light and heavy 

25 immunoglobulin transgenes - that undergo VDJ joining during 
D-cell differentiation - is contacted with an antigen to 
induce production of a heterologous antibody in a secondary 
repertoire B-cell. 

Also included in the invention are vectors and 

30 methods to disrupt the endogenous immunoglobulin loci in the 
1 non-human animal to be used in the invention. Such vectors 

and methods utilize a transgene, preferably positive-negative 
» selection vector, which is constructed such that it targets 

the functional disruption of a class of gene segments encoding 

35 a heavy and/or light immunoglobulin chain endogenous to the 
non-human animal used in the invention. Such endogenous gene 
segments include diversity, joining and constant region gene 
segments. In this aspect of the invention, the 
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positive-negative selection vector is contacted, with at least 
one embryonic stem cell of a non-human animal after which 
cells are selected wherein the positive-negative selection 
vector has integrated into the genome of the non-human animal 

5 by way of homologous recombination. After transplantation, 
the resultant transgenic non-human animal is substantially 
incapable of mounting an immunoglobulin-mediated immune 
response as a result of homologous integration of the vector 
into chromosomal DNA. Such immune deficient non-human animals 

10 may thereafter be used for study of immune deficiencies or 

used as the recipient of heterologous immunoglobulin heavy and 

light chain transgenes. 

The invention also provides vectors, methods, and 
compositions useful for suppressing the expression of one or 

15 more species of immunoglobulin chain (s), without disrupting an 
endogenous immunoglobulin locus. Such methods are useful for 
suppressing expression of one or more endogenous 
immunoglobulin chains while permitting the expression of one 
or more transgene-encoded immunoglobulin chains. Unlike 

20 genetic disruption of an endogenous immunoglobulin chain 

locus, suppression of immunoglobulin chain expression does not 
require the time-consuming breeding that is needed to 
establish transgenic animals homozygous for a disrupted 
endogenous Ig locus. An additional advantage of suppression 

25 as compared to engognous Ig gene disruption is that, in 

certain embodiments, chain suppression is reversible within an 
individual animal. For example, Ig chain suppression may be 
accomplished with: (1) transgenes encoding and expressing 
antisense RNA that specifically hybridizes to an endogenous Ig 

30 chain gene sequence, (2) antisense oligonucleotides that 
specifically hybridize to. an endogenous Ig chain gene 
' sequence, and (3) immunoglobulins that bind specifically to an 
endogenous Ig chain polypeptide. 

The references discussed herein are provided solely 

35 for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to antedate such 
disclosure by virtue of prior invention. 
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BRIEF DESCRIPTION OF THE FIGURES 
Fig- 1 depicts the complementarity determining 
regions CDR1, CDR2 and CDR3 * and framework regions FR1 , FR2 , 
FR3 and FR4 in unrearranged genomic DNA and mRNA expressed 
from a rearranged immunoglobulin heavy chain gene, 
Fig. 2 depicts the human X chain locus, 
Fig. 3 depicts. the human k chain locus, 
Fig. 4 depicts the human heavy chain locus, 
Fig. 5 depicts a transgene construct containing a 
rearranged IgM gene ligated to a 25 kb fragment that contains 
human y3 and 7 1 constant regions followed by a 700. bp fragment 
containing the rat chain 3' enhancer sequence. 

Fig. 6 is a restriction map of the human k chain 
locus depicting the fragments to be used to form a light chain 
transgene by way of in vivo homologous recombination. 

Fig. 7 depicts the construction of pGPl. 

Fig. 8 depicts the construction of the polylinker 

contained in pGPl. 

Fig. 9 depicts the fragments used to construct a 
human heavy chain transgene of the invention. 

Fig. 10 depicts the construction of pHIGl and pCONl. 

Fig. 11 depicts the human C7I fragments which are 
inserted into pRE3 (rat enhancer 3 1 ) to form pREG2. 

Fig. 12 depicts the construction of pHIG3 • and PCON. 

Fig. 13 depicts the fragment containing human D 
region segments used in construction of the transgenes of the 
invention. 

Fig. 14 depicts the construction of pHIG2 (D segment. 

containing plasmid) . 

Fig. 15 depicts the fragments covering the human Jk 
and human Cjc gene segments used in constructing a transgene of 
the invention. 

Fig. 16 depicts the structure of pEM- 

Fig. 17 depicts the construction of pKapH. 

Figs. 18A through 18D depict the construction of a. 
positive-negative selection vector for functionally disrupting 
the endogenous heavy chain immunoglobulin locus of mouse. 
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Figs. 19A through 19C depict the construction of a 
positive-negative selection vector for functionally disrupting 
the endogenous immunoglobulin light chain loci in mouse. 

Figs. 20 a through e depict the structure of a kappa 

5 light chain targeting vector. 

Figs. 21 a through f depict the structure of a mouse 

heavy chain targeting vector. 

Fig. 22 depicts the map of vector pGPe. 
Fig. 23 depicts the structure of vector pJM2. 
10 Fig. 24 depicts the structure of vector pCORl. 

Fig . 25 depicts the transgene constructs for pIGMl, 

pHCl and pHC2. 

Fig. 26 depicts the structure of pye2. 

Fig. 27 depicts the structure of pVGEl. 

15 F ig. 28 depicts the assay results of human Ig 

expression in a pHCl transgenic mouse. 

Fig,. 29 depicts the structure of pJCKl. 

Fig. 30 depicts the construction of a synthetic 

heavy chain variable region. 
20 Fig. 31 is a schematic representation of the 'heavy 

chain minilocus constructs pIGM lr pHCl, and pHC2. 

Fig. 32 is a schematic representation of the heavy 

chain minilocus construct pIGGl and the k light chain 

minilocus construct pKCl, pKVel, and pKC2. 
25 Fig. 33 depicts a scheme to reconstruct functionally 

rearranged light chain genes. 

Fig. 34 depicts serum ELISA results 

Fig. 35 depicts the results of an ELISA assay of 

serum from 8 transgenic mice. 
30 pig. 36 is a schematic representation of plasmid 

pBCEl. 

' ' Fig. 37 depicts the immune response of transgenic 

mice of the present invention against KLH-DNP, by measuring 
IgG and IgM levels specific for KLH-DNP (37A) , KLH (37B) and 

35 BSA-DNP (37C). 

Fig. 38 shows ELISA data demonstrating the. presence 
of antibodies that bind human carcinoembryonic antigen (CEA) 
and comprise human m chains; each panel shows reciprocal 
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serial dilutions from pooled serum samples obtained from mice 
on the indicated day following immunization. 

Fig. 39 shows ELISA data demonstrating the presence 
of antibodies that bind human carcinoembryonic antigen (CEA) 
5 and comprise human y chains; each panel shows reciprocal 

serial dilutions from pooled serum samples obtained from mice 
on the indicated day following immunization. 

Fig. 40 shows aligned variable region sequences of 
23 randomly-chosen cDNAs generated from mRNA obtained from 

10 lymphoid tissue of HC1 transgenic mice immunized, with human 
carcinoembryonic antigen (CEA) as compared to the germline 
transgene sequence (top line) ; on each line nucleotide changes 
relative to germline sequence are shown above the alteration 
in deduced amino acid sequence (if any) ; the regions 

15 corresponding to heavy chain CDR1, CDR2, and CDR3 are 

indicated. Non-germline encoded nucleotides are shown in 
capital letters. Germline V H 251 and J H are shown in lower case 
letters. Deduced amino acid changes are given beneath 
nucleotide sequences using th conventional single-letter 

20 notation. 

Fig. 41 shows the data from Fig. 40 in histogram 
format; deduced amino acid residue position is shown as the 
ordinate (left is the amino-terminal direction, right is in 
the direction towards the carboxy-terminus) and frequency of 
25 sequence variation is shown as the abscissa. 

Fig.. 42 show the nucleotide sequence of a human DNA 
fragment, designated vk65.3, containing a V K gene segment; the 
deduced amino acid sequences of the V K coding regions are also 
shown; splicing and recombination signal sequences 
30 (heptamer/nonamer) are shown boxed, 
f Figf. 43 show the nucleotide sequence of a human DNA 

fragment/ desiignated vk65.5- f containing a V K gene segment; the 
deduced amino acid sequences of the V K coding regions are also 
shown; splicing and recombination signal sequences 
35 (heptamer/nonamer) are shown boxed. 

Fig. 44 show the nucleotide sequence of a human DNA 
fragment, designated vk65 .8 , containing a V K gene segment; the 
deduced amino acid sequences of the V K coding regions are also 
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shown; splicing and recombination signal sequences 
(heptamer/nonamer) are shewn boxed. 

Fig. 45 show the nucleotide sequence of a human DNA 
fragment, designated vk65.15, containing a V K gene segment; 
the deduced amino acid sequences of the V K coding regions are 
also shown; splicing and recombination signal sequences 
(heptamer/nonamer) are shown boxed. 

Fig. 46 shows formation of a light chain minilocus 
by homologous recombination between two overlapping fragments 
which were co- injected. 

Table 1 depicts the sequence of vector pGPe. 

Table 2 depicts the sequence of gene V H 4::.8. 

Table 3 depicts the detection of human IgM and IgG 
in the serum of transgenic mice of this invention. 

Table 4 depicts sequences of VDJ joints. 

Table 5 depicts the distribution of J segments 
incorporated into pHCl transgene encoded transcripts to J 
segments found in adult human peripheral blood lymphocytes 
(PBL) . 

Table 6 depicts the distribution of D segments 
incorporated into pHCl transgene encoded transcripts to D 
segments^found in adult human peripheral blood lymphocytes 
(PBL) . 

Table 7 depicts the length of the CDR3 peptides from 
transcripts with in-frame VDJ joints in the pHCl transgenic 

mouse and in human PBL. 

Table 8 depicts the predicted amino acid sequences 
of the VDJ regions from 30 clones analyzed from a pHCl 
transgenic. 

Table 9 shows transgenic mice of line 112 that were 
used in the indicated experiments; (+) indicates the presence 
of the respective transgene, (++) ir-icates that the animal is 
homozygous for the J H D knockout transgene. 

DETAILED DESCRIPTION 
As has been discussed supra, it is desirable to 
produce human immunoglobulins that are reactive with specific 
human antigens that are promising therapeutic and/or 
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diagnostic targets. However, producing human immunoglobulins 
that bind specifically with human antigens is problematic. 

First, the immunized animal that serves as the 
source of B cells must make an immune response against the 
5 presented antigen. In order for an animal to make an immune 
response, the antigen presented must be foreign and the animal 
must not be tolerant to the antigen. Thus, for example, if it 
is desired to produce a human monoclonal antibody with an 
idiotype that binds to a human protein, self -tolerance will 

10 prevent an immunized human from making a substantial immune 
response to the human protein, since the only epitopes of the 
antigen that may be immunogenic will be those that result from 
polymorphism of the protein within the human population 
(allogeneic epitopes) . 

15 Second, if the animal that serves as the source of 

B-cells for forming a hybridoma (a human in the illustrative 
given example) does make an immune response against an 
authentic self antigen, a severe autoimmune disease may result 
in the animal. Where humans would be used as a source of B- 

20 cells for a hybridoma, such autoimmunization would be 
considered unethical by contemporary standards. 

One methodology that can be used to obtain human 
antibodies that are specifically reactive with human antigens 
is the production of a transgenic mouse harboring the human 

25 immunoglobulin transgene constructs of this invention. 

Briefly, transgenes containing all or portions of the human 
immunoglobulin heavy and light chain loci, or transgenes 
containing synthetic "miniloci" (described infra, and in 
PCT/US91/06185 filed August 28, 1991) which comprise essential 

30 functional elements of the human heavy and light chain loci, 
yg are employed to produce a transgenic nonhuman animal. Such a 

transgenic nonhuman animal will have the capacity to produce 
immunoglobulin chains that are encoded by human immunoglobulin 
genes, and additionally will be capable of making an immune 

35 response against human antigens. Thus, such transgenic animals 
. can serve as a source of immune sera reactive with specified 
human antigens, and B-cells from such transgenic animals can 
be fused with myeloma cells to produce hybridomas that secrete 
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monoclonal antibodies that are encoded by human immunoglobulin 
genes and which are specifically reactive with human antigens. 

The production of transgenic mice containing various 
forms of immunoglobulin genes has been reported previously. 
5 Rearranged mouse immunoglobulin heavy or light chain genes 
have been used to produce transgenic mice. In addition, 
functionally rearranged human Ig genes including the y. or yl 
constant region have been expressed in transgenic mice. 
However, experiments in which the transgene comprises 

10 unrearranged (V-D-J or V-J not rearranged) immunoglobulin 

genes have been variable, in some cases, producing incomplete 
or minimal rearrangement of the transgene. However, there are 
no published examples of either rearranged or unrearranged 
immunoglobulin transgenes which undergo successful isotype 

15 switching between C H genes within a transgene. 



nef initions 



As used herein, the term "antibody" refers to a 

20 glycoprotein comprising at least two light polypeptide chains 
and two heavy polypeptide chains. Each of the heavy and light 
polypeptide chains contains a variable region (generally the 
amino terminal portion of the polypeptide chain) which 
contains a binding domain which interacts with antigen. Each 

25 of the heavy and light polypeptide chains also comprises a 
constant region of the polypeptide chains (generally the 
carboxyl terminal portion) which may mediate the binding of 
the immunoglobulin to host tissues or factors including 
various cells of the immune system, some phagocytic cells and 

30 the first component (Clq) of the classical complement system. 

As used herein, a "heterologous antibody" is defined 
in relation to the transgenic non-human organism producing 
such an antibody. It is defined as an antibody having an amino 
acid sequence or an encoding DNA sequence corresponding to 

35 that found in an organism not consisting of the transgenic 

non-human animal. 

As used herein, a "heter ©hybrid antibody" refers to 
an antibody having a light and heavy chains of different 
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organismal origins. For example, an antibody having a human 
heavy chain associated with a murine light chain is a 
heterohybrid antibody. 

As used herein, "isotype" refers to the antibody 
5 class (e.g., IgM or IgG x ) that is encoded by heavy chain 
constant region genes. 

As used herein, "isotype switching" refers to the 
phenomenon by which the class, or isotype, of an antibody 
changes from one Ig class to one of the other Ig classes, 

10 As used herein, "nonswitched isotype" refers to the 

isotypic class of heavy chain that is produced when no isotype 
switching has taken place; the C H gene encoding the 
nonswitched isotype is typically the first C H gene immediately 
downstream from the functionally rearranged VDJ gene. 

15 As used herein, the term "switch sequence" refers to 

those DNA sequences responsible for switch recombination. A 
"switch donor" sequence, typically a \x switch region, will be 
5' (i.e., upstream) of the construct region to be deleted 
during the switch recombination. The "switch acceptor" region 

20 will be between the construct region to be deleted and the 
replacement constant region (e.g., 7, e, etc.). As there is 
no specific site where recombination always occurs, the final 
gene sequence will typically not be predictable from the 
construct . 

25 As used herein, "glycosylation pattern" is defined 

as the pattern of carbohydrate units that are covalently 
attached to a protein, more specifically to an immunoglobulin 
protein. A glycosylation pattern of a heterologous antibody 
can be characterized as being substantially similar to 

30 glycosylation patterns which occur naturally on antibodies 
produced by the species of the nonhuman transgenic animal, 
when one of ordinary skill in the art- would recognize the 
glycosylation pattern of the heterologous antibody as being 
more similar to said pattern of glycosylation in the species 

35 of the nonhuman transgenic animal than to the species from 
which the C H genes of the transgene were derived. 

As used herein, "specific binding" refers to the 
property of the antibody: (1) to bind to a predetermined 
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antigen with an affinity of at least 1 x 10' IT}, and (2) to 
preferentially bind to the predetermined antigen with an 
affinity that is at least twc-fold greater than its affinity 
for binding to a non-specific antigen (e.g., BSA, casein) 
5 other than the predetermined antigen. 

The term "naturally-occurring" as used herein as 
applied to an object refers to the fact that an object can be 
found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) 

10 that can be isolated from a source in nature and which has not 
been intentionally modified by man in the laboratory is 
naturally-occurring. 

The term "rearranged" as used herein refers to a 
configuration of a heavy chain or light chain immunoglobulin 

15 locus wherein a V segment is positioned immediately adjacent 
to a D-J or J segment in a conformation encoding essentially a 
complete V H or V L domain, respectively. A rearranged 
immunoglobulin gene locus can be identified by comparison to 
germline DNA; a rearranged locus will have at least one 

20 recombined heptamer/nonamer homology element. 

The term "unrearranged" or "germline configuration" 
as used herein in reference to a V segment refers to the 
configuration wherein the V segment is not recombined so as to 
be immediately adjacent to a D or J segment. 

25 

Transgenic Nonhuman Animals Capable 
nf Producing Hetero] oaous Antibodies 

The design of a transgenic non-human animal that 
30 responds to foreign antigen stimulation with a heterologous 
antibody repertoire, requires that the heterologous 
immunoglobulin transgenes contained within the transgenic 
animal function correctly throughout the pathway of B-cell 
development. In a preferred embodiment, correct function of a 
35 heterologous heavy chain transgene includes isotype switching. 
Accordingly, the transgenes of the invention are constructed 
so as to produce isotype switching and one or more of the 
following: (1) high level and cell-type specific expression, 
(2) functional gene rearrangement, (3) activation of and 
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response to allelic exclusion, (4) expression of a sufficient 
primary repertoire, (5) signal transduction, (6) somatic 
hypermutation, and (7) domination of the transgene antibody 
locus during the immune response. 

5 As will be apparent from the following disclosure, 

not all of the foregoing criteria need be met. For example, in 
those, embodiments wherein the endogenous immunoglobulin loci 
of the transgenic animal are functionally disrupted, the 
transgene need not activate allelic exclusion. Further, in 

10 those embodiments wherein the transgene comprises a 
functionally rearranged heavy and/or light chain 
immunoglobulin gene, the second criteria of functional gene 
rearrangement is unnecessary, at least for that transgene 
which is already rearranged. For background on molecular 

15 immunology, see, Fundamental immunology . 2nd edition (1989) , 
Paul William E. , ed. Raven Press, N.Y., which is incorporated 
herein by reference. 

In one aspect of the invention, transgenic non-human 
animals are provided that contain rearranged, unr ear ranged or 

20 a combination of rearranged and unrearranged heterologous 
immunoglobulin heavy and light chain transgenes in the 
germline of the transgenic animal. Each of the heavy chain 
transgenes comprises at least one C H gene. In addition, the 
heavy chain transgene may contain functional isotype switch 

25 sequences, which are capable of supporting isotype switching 
of a heterologous transgene encoding multiple C H genes in B- 
cells of the transgenic animal. Such switch sequences may be 
those which occur naturally in the germline immunoglobulin 
locus from the species that serves as the source of the 

30 transgene C H genes, or such switch sequences may be derived 
from those which occur in the species that is to receive the 
transgene construct (the transgeneic animal) . For example, a 
human transgene construct that is used to produce a transgenic 
mouse may produce a higher frequency of isotype switching 

35 events if it incorporates switch sequences similar to those 
that occur naturally in the mouse heavy chain locus, as 
presumably the mouse switch sequences are optimized to 
function with the mouse switch recombinase enzyme system, 
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whereas the human switch sequences are not. Switch sequences 
made be isolated and cloned by conventional cloning methods, 
or may be synthesized de novo from overlapping synthetic 
oligonucleotides designed on the basis of published sequence 
5 information relating to immunoglobulin switch region sequences 
(Mills et al., Nnr.l. Acids Res . 18:7305-7316 (1991); 
Sideras et al. , Tntl . Immunol . 1:631-642 (1989), which are 
incorporated herein by reference) . 

For each of the foregoing transgenic animals, 

10 functionally rearranged heterologous heavy and light chain 
immunoglobulin transgenes are found in a significant fraction 
of the B-cells of the transgenic animal (at least 10 percent) . 

The transgenes of the invention include a heavy 
chain transgene comprising DNA encoding at least one variable 

15 gene segment, one diversity gene segment, one joining gene 
segment and at least one constant region gene segment. The 
immunoglobulin light chain transgene comprises DNA encoding at 
least one variable gene segment, one joining gene segment and 
at least one constant region gene segment. The gene segments 

20 encoding the light and heavy chain gene segments are 

heterologous to the transgenic non-human animal in that they 
are derived from, or correspond to, DNA encoding 
immunoglobulin heavy and light chain gene segments from a 
species not consisting of the transgenic non-human animal. In 

25 one aspect of the invention, the transgene is constructed such 
that the individual gene segments are unrearranged, i.e., not 
rearranged so as to encode a functional immunoglobulin light 
or heavy chain. Such unrearranged transgenes support 
recombination of the V, D, and J gene segments (functional 

30 rearrangement) and preferably support incorporation of all or 
a portion of a D region gene segment in the resultant 
rearranged immunoglobulin heavy chain within the transgenic 
non-human animal when exposed to antigen. 

In an alternate embodiment, the transgenes comprise 

35 an unrearranged "mini-locus". Such transgenes typically 

comprise a substantial portion of the C, D, and J segments as 
well as a subset of the V gene segments. In such transgene 
constructs, the various regulatory sequences, e.g. promoters, 
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enhancers, class switch regions, splice-donor and splice- 
acceptor sequences for RNA processing, recombination signals 
and the like, comprise corresponding sequences derived from 
the heterologous DNA. Such regulatory sequences may be 
5 incorporated into the transgene from the same or a related 
species of the non-human animal used in the invention. For 
example, human immunoglobulin gene segments may be combined in 
a transgene with a rodent immunoglobulin enhancer sequence for 
use in a transgenic mouse. Alternatively, synthetic regulatory. 

10 sequences may be incorporated into the transgene, wherein such 
synthetic regulatory sequences are not homologous to a 
functional DNA sequence that is known to occur naturally in 
the genomes of mammals. Synthetic regulatory sequences are 
designed according to consensus rules, such as, for example, 

15 those specifying the permissible sequences of a splice- 
acceptor site or a promoter/enhancer motif. 

The invention also includes transgenic animals 
containing germ line cells having a heavy and light transgene 
wherein one of the said transgenes contains rearranged gene 

20 segments with the other containing unrearranged gene segments. 
In the preferred embodiments, the rearranged transgene is a 
light chain immunoglobulin transgene and the unrearranged 
transgene is a heavy chain immunoglobulin transgene. 

25 The Struct ure and Generation of Antibodies 

The basic structure of all immunoglobulins is based 
upon a unit consisting of two light polypeptide chains and two 
heavy polypeptide chains. Each light chain comprises two 
regions known as the variable light chain region and the 
constant light chain region. Similarly, the immunoglobulin 
heavy chain comprises two regions designated the variable 
heavy chain region and the constant heavy chain region. 

The constant region for the heavy or light chain is 
encoded by genomic sequences referred to as heavy or light 
constant region gerie (C H ) segments. The use of a particular 
heavy chain gene segment defines the class of immunoglobulin. 
For example, in humans, the m constant region gene segments 
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define the IgM class of antibody whereas the use of a y, y2, 
7 3 or 7 4 constant region gene segment defines the IgG class of 
antibodies as well as the IgG subclasses IgGl through IgG4. 
Similarly, the use of a a x or a 2 constant region gene segment 
defines the IgA class of antibodies as well as the subclasses 
IgAl and IgA2. The S and 6 constant region gene segments 
define the IgD and IgE antibodv classes, respectively. 

The variable regions of the heavy and light 
immunoglobulin chains together contain the antigen binding 
domain of the antibody. Because of the need for diversity in 
this region of the antibody to permit binding to a wide range 
of antigens, the DNA encoding the initial or primary 
repertoire variable region comprises a number of different DNA 
segments derived from families of specific variable region 
15 gene segments. In the case of the light chain variable 

region, such families comprise variable (V) gene segments and 
joining (J) gene segments. Thus, the initial variable region 
of the light chain is encoded by one V gene segment and one J 
gene segment each selected from the family of V and J gene 
20 segments contained in the genomic DNA of the organism. In the 
case of the heavy chain variable region, the DNA encoding the 
initial or primary repertoire variable region of the heavy 
chain comprises one heavy chain V gene segment, one heavy 
chain diversity (D) gene segment and one J gene segment, each 
25 selected from the appropriate V, D and J families of 
immunoglobulin gene segments in genomic DNA. 

In order to increase the diversity of sequences that 
contribute to forming antibody binding sites, it is preferable 
that a heavy chain transgene include cis-acting sequences that 
30 support functional V-D-J rearrangement that can incorporate 
all or part of a D region gene sequence in a rearranged V-D-J 
gene sequence. Typically, at least about 1 percent of 
expressed transgene-encoded heavy chains (or mRNAsl include 
recognizable D region sequences in the V region. Preferably, 
35 at least about 10 percent of transgene-encoded V regions 
include recognizable D region sequences, more preferably at 
least about 30 percent, and most preferably more than 50 
percent include recognizable D region sequences. 



WO 93/12227 



PCI7US92/10983 



23 

A recognizable D region sequence is generally at 
least about eight consecutive nucleotides corresponding to a 
sequence present in a D region gene segment of a heavy chain 
transgene and /or the amino acid sequence encoded by such D 
region nucleotide sequence. For example, if a transgene 
includes the D region gene DHQ52, a transgene-encoded mRNA 
containing the sequence 5 ' -TAACTGGG-3 1 located in the V region 
between a V gene segment sequence and a J gene segment 
sequence is recognizable as containing a D region sequence, 
specifically a DHQ52 sequence. Similarly, for example, if a 
transgene includes the D region gene DHQ52, a transgene- 
encoded heavy chain polypeptide containing the amino acid 
sequence -DAF- located in the V region between a V gene 
segment amino acid sequence and a J gene segment amino acid 
sequence is recognizable as containing a D region sequence, 
specifically a DHQ52 sequence. 

However, because of somatic mutation and N-region 
addition, some D region sequences may be recognizable but may 
not correspond identically to a consecutive D region sequence 
in the transgene. For example, a nucleotide sequence 5'- 
CTAAXTGGGG-3 * , where X is A, T, or G, and which is located in 
a heavy chain V region and flanked by a V region gene sequence 
and a J region gene sequence, can be recognized as 
corresponding to the DHQ52 sequence 5 ' -CTAACTGGG-3 1 . 
Similarly, for example, the polypeptide sequences -DAFDI-, 
-DYFDY-, or -GAFDI- located in a V region and flanked on the 
amino-terminal side by an amino acid sequence encoded by a 
transgene V gene sequence and flanked on the carboxyterminal 
side by an amino acid sequence encoded by a transgene J gene 
sequence is recognizable as a D region sequence. 

Therefore, because somatic mutation and N-region 
addition can produce mutations in sequences derived from a 
transgene D region, the following definition is provided as a 
guide for determining the presence of a recognizable D region 
sequence. An amino acid sequence or nucleotide sequence is 
recognizable as a D region sequence if: (1) the sequence is 
located in a V region and is flanked on one side by a V gene 
sequence (nucleotide sequence or deduced amino acid sequence) 
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and on the other side by a J gene sequence (nucleotide 
sequence or deduced amino acid sequence) and (2) the sequence 
is substantially identical or substantially similar to a known 
D gene sequence (nucleotide sequence or encoded amino acid 
sequence) . 

The term "substantial identity" as used herein 
denotes a characteristic of a polypeptide sequence or nucleic 
acid sequence, wherein the polypeptide sequence has at least 
50 percent sequence identity compared to a reference sequence, 
and the nucleic acid sequence has at least 70 percent sequence 
identity compared to a reference sequence. The percentage of 
sequence identity is calculated excluding small deletions or 
additions which total less than 35 percent of the reference 
sequence. The reference sequence may be a subset of a larger 
15 sequence, such as an entire D gene; however, the reference 
sequence is at least 8 nucleotides long in the case of 
polynucleotides, and at least 3 amino residues long in the 
case of a polypeptide. Typically, the reference sequence is 
. at least 8 to 12 nucleotides or at least 3 to 4 amino acids, 
20 and preferably the reference sequence is 12 to 15 nucleotides 
or more, or at least 5 amino acids. 

The term "substantial similarity" denotes a 
characteristic of an polypeptide sequence, wherein the 
polypeptide sequence has at least 80 percent similarity to a 
25 reference sequence. The percentage of sequence similarity is 
calculated by scoring identical amino acids or positional 
conservative amino acid substitutions as similar. A 
positional conservative amino acid substitution is one that 
can result from a single nucleotide substitution; a first 
30 amino acid is replaced by a second amino acid where a codon 
for the first amino acid and. a codon for the second amino acid 
can differ by a single nucleotide substitution. Thus, for 
example, the sequence -Lys-Glu-Arg-Val- is substantially 
similar to the sequence -Asn-Asp-Ser-Val- , since the codon 
35 sequence -AAA-GAA-AGA-GUU- can be mutated to -AAC-GAC-AGC-GUU- 
by introducing only 3 substitution mutations, single 
nucleotide substitutions in three of the four original codons. 
The reference sequence may be a subset of a larger sequence, 
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such as an entire D gene; however, the reference sequence is 
at least 4 amino residues long. Typically, the reference 
sequence is at least 5 amino acids, and preferably the 
reference sequence is 6 amino acids or more. 

5 

The Primary Repertoire 

The process for generating DNA encoding the heavy 
and light chain immunoglobulin genes occurs primarily in 

10 developing B-cells. Prior to the joining of various 

immunoglobulin gene segments, the V, D, J and constant (C) 
gene segments are found, for the most part, in clusters of V/ 
D, J and C gene segments in the precursors of primary 
repertoire B-cells. Generally, all of the gene segments for a 

15 heavy or light chain are located in relatively close proximity 
on a single chromosome. Such genomic DNA prior to 
recombination of the various immunoglobulin gene segments is 
referred to herein as "unrearranged" genomic DNA. During 
B-cell differentiation, one of each of the appropriate family 

20 members of the V, D, J (or only V and J in the case of light 
chain genes) gene segments are recombined to form functionally 
rearranged heavy and light immunoglobulin genes. Such 
functional rearrangement is of the variable region segments to 
form DNA encoding a functional variable region. This gene 

25 segment rearrangement process appears to be sequential. 

First, heavy chain D-to-J joints are made, followed by heavy 
chain V-to-DJ joints and light chain V-to-J joints. The DNA 
encoding this initial form of a functional variable region in 
a light and/or heavy chain is referred to as "functionally 

30 rearranged DNA" or "rearranged DNA". In the case of the heavy 
chain, such DNA is referred to as "rearranged heavy chain DNA" 
and in the case of the light chain, such DNA is referred to as 
"rearranged light chain DNA". Similar language is used to 
describe the functional rearrangement of the transgenes of the 

35 invention. 

The recombination of variable region gene segments 
to form functional heavy and light chain variable regions is 
mediated by recombination signal sequences (RSS»s) that flank 
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recombinationally competent V, D and J segments. RSS's 
necessary and sufficient to direct recombination, comprise a 
dy ad-symmetric heptamer, an AT-rich nonamer and an intervening 
spacer region of either 12 or 23 base pairs. These signals 
5 are conserved among the different loci and species that carry 
out D-J (or V-J) recombination and are functionally 

_ , , . 0 *. ai M 990) , Science , 248 , 

interchangeable. See Oettinger, et ai. » 

1517-1523 and references cited therein. The heptamer 
comprises the sequence CACAGTG or its analogue followed by a 
0 spacer of unconserved sequence and then a nonamer having the 
sequence ACAAAAACC or its analogue. These sequences are found 
on the J, or downstream side, of each V and D gene segment, 
immediately preceding the germline D and J segments are again 
two recombination signal sequences, first the nonamer and then 
L5 the heptamer again separated by an unconserved sequence. The 
heptameric and nonameric sequences following a V L , V H or D 
segment are complementary to those preceding the J L , D or J H 
segments with which they recombine. The spacers between the 
heptameric and nonameric sequences are either 12 base pairs 
JO long or between 22 and 24 base pairs long. 

in addition to the rearrangement of V, D and J 
segments, further diversity is generated in the primary 
repertoire of immunoglobulin heavy and light chain by way of 
variable recombination between the V and J segments in the 
25 light chain and between the D and J segments of the heavy 

chain. Such variable recombination is generated by variation 
in the exact place at which such segments are joined. Such 
variation in the light chain typically occurs within the last 
codon of the V gene segment and the first codon of the J 
30 segment. Similar imprecision in joining occurs on the heavy 
chain chromosome between the D and J H segments and may extend 
over as many as 10 nucleotides. Furthermore, several 
nucleotides may be inserted between the D and J H and between 
the V H and D gene segments which are not encoded by genomic 
35 DNA. The addition of these nucleotides is known as N-region 

diversity. . - 

After VJ and/or VDJ rearrangement, transcription of 

the rearranged variable region and one or more constant region 
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gene segments located downstream from the rearranged variable 
region produces a primary RNA transcript which upon 
appropriate RNA splicing results in an mRNA which encodes a 
full length heavy or light immunoglobulin chain. Such heavy 
5 and light chains include a leader signal seguence to effect 
secretion through and/or insertion of the immunoglobulin into 
the transmembrane region of the B-cell. The DNA encoding, this 
signal sequence is contained within the first exon of the V 
segment used to form the variable region of the heavy or light 

10 immunoglobulin chain. Appropriate regulatory sequences are 
also present in the mRNA to control translation of the mRNA to 
produce the encoded heavy and light immunoglobulin 
polypeptides which upon proper association with each other 
form an antibody molecule. 

15 The net effect of such rearrangements in the 

variable region gene segments and the variable recombination 
which may occur during such joining, is the production of a 
primary antibody repertoire. Generally, each B-cell which has 
differentiated to this stage, produces a single primary 

20 repertoire antibody. During this differentiation process, 
cellular events occur which suppress the functional 
rearrangement of gene segments other than those contained 
within the functionally rearranged Ig gene. The process by 
which diploid B-cells maintain such mono-specificity is termed 

25 allelic exclusion. 



The Secondary Repertoire 

B-cell clones expressing immunoglobulins from within 
30 the set of sequences comprising the primary repertoire are 

immediately available to respond to foreign antigens. Because 
of the limited diversity generated by simple VJ and VDJ 
joining, the antibodies produced by the so-called primary 
response are of relatively low affinity. Two different types 
35 of B-celis make up this initial response: precursors of 
primary antibody-forming cells and precursors of secondary 
repertoire B-cells (Linton et al., Cell 59:1049-1059 (1989)). 
The first type of B-cell matures into IgM-secreting plasma 
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cells in response to certain antigens. The other B-cells 
respond to initial exposure to antigen by entering a T-cell 
dependent maturation pathway. 

During the T-cell dependent maturation of antigen 
' 5 stimulated B-cell clones, the structure of the antibody 

molecule on the cell surface changes in two ways: the constant 
region switches to a non-IgM subtype and the sequence of the 
variable region can be modified by multiple single ammo acxd 
substitutions to produce a higher affinity antibody molecule. 
10 as previously indicated, each variable region of a 

heavy or light Ig chain contains an antigen binding domain. 
It has been determined by amino acid and nucleic acid 
sequencing that somatic mutation during the secondary response 
occurs throughout the V region including the three 
15 complementary determining regions (CDR1, CDR2 and CDR3) also 
referred to as hypervariable regions 1, 2 and 3 (Kabat et al. 

o. T1 ^p g «f Pr f T^nnlnmral interest (1991) U.S. 

Department of Health and Human Services, Washington, DC, 
incorporated herein by reference. The CDR1 and CDR2 are 
20 located within the variable gene segment whereas the CDR3 is 
largely the result of recombination between V and J gene 
segments or V, D and J gene segments. Those portions of the 
variable region which do not consist of CDR1, 2 or 3 are 
commonly referred to as framework regions designated FR1, FR2, 
25 FR3 and FR4 . See Fig. 1. During hypermutation, the 

rearranged DNA is mutated to give rise to new clones with 
altered Ig molecules. Those clones with higher affinities for 
the foreign antigen are selectively expanded by helper 
T-cells, giving rise to affinity maturation of the expressed 
30 antibody. Clonal selection typically results in expression of 
clones containing new mutation within the CDR1, 2 and/or 3 
• regions. However, mutations outside these regions also occur 
which influence the specificity and affinity of the antigen 
binding domain. 

35 
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Transgenic Non-Human Animals Capable 
of Producing Heterologous Ant ibody — 

Transgenic non-human animals in one aspect of the 
5 invention are produced by introducing at least one of the 
immunoglobulin transgenes of the invention (discussed 
hereinafter) into a zygote or early embryo of a non-human 
animal. The non-human animals which are used in the invention 
generally comprise any mammal which is capable of rearranging 

10 immunoglobulin gene segments to produce a primary antibody 
response. Such nonhuman transgenic animals may include, for 
example, transgenic pigs, transgenic rats, transgenic rabbits, 
transgenic cattle, and other transgenic animal species, 
particularly, mammalian species, known in the art. A 

15 particularly preferred non-human animal is the mouse or other 
members of the rodent family. 

However, the invention is not limited to the use of 
mice. Rather, any non-human mammal which is capable of 
mounting a primary and secondary antibody response may be 

20 used. Such animals include non-human primates, such as 

chimpanzee, bovine, ovine, and porcine species, other members 
of the rodent family, e.g. rat, as well as rabbit and guinea 
pig. Particular preferred animals are mouse, rat, rabbit and 
guinea pig, most preferably mouse. 

25 in one embodiment of the invention, various gene 

segments from the human genome are used in heavy and light 
chain transgenes in an unrearranged form. In this embodiment, 
such transgenes are introduced into mice. The unrearranged 
gene segments of the light and/or heavy chain transgene have 

30 DNA sequences unique to the human species which are 

distinguishable from the endogenous immunoglobulin gene 
segments in the mouse genome. They may be readily detected in 
unrearranged form in the germ line and somatic cells not 
consisting of B-cells and in rearranged form in B-cells. 

35 In an alternate embodiment of the invention, the 

transgenes comprise rearranged heavy and/ or light 
immunoglobulin transgenes. Specific segments of such 
. transgenes corresponding to functionally rearranged VDJ or VJ 
segments, contain immunoglobulin DNA sequences which are also 
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clearly distinguishable from the endogenous immunoglobulin 

gene segments in the mouse. 

Such differences in DNA sequence are also reflected 

in the amino acid sequence encoded by such human 
immunoglobulin transgenes as compared to those encoded bu- 
rnouse B-cells. Thus, human immunoglobulin ammo acid 
sequences may be detected in the transgenic non-human animals 
of the invention with antibodies specific for immunoglobulin 
epitopes encoded by human immunoglobulin gene segments. 

Transgenic B-cells containing unrearranged 
transgenes from human or other species functionally recombine 
the appropriate gene segments to form functionally rearranged 
light and heavy chain variable regions. It will be readily 
apparent that the antibody encoded by such rearranged 
transgenes has a DNA and/or amino acid sequence which is 
heterologous to that normally encountered in the nonhuman 
animal used to practice the invention. 



iinrearrar^"^ Transgenes 

As used herein, an "unrearranged immunoglobulin 
heavy chain transgene" comprises DNA encoding at least one 
variable gene segment, one diversity gene segment, one joining 
gene segment and one constant region gene segment. Each of 
the gene segments of said heavy chain transgene are derived 
from, or has a sequence corresponding to, DNA encoding 
immunoglobulin heavy chain gene segments from a species not 
consisting of the non-human animal into which said transgene 
is introduced. Similarly, as used herein, an "unrearranged 
immunoglobulin light chain transgene" comprises DNA encoding 
at least one variable gene segment, one joining gene segment 
and at least one constant region gene segment wherein each 
gene segment of said light chain tr.nsgene is derived from, or 
has a sequence corresponding to, DNA encoding immunoglobulin 
light chain gene segments from a species not consisting of the 
non-human animal into which said light chain transgene is 

introduced. m 

such heavy and light chain transgenes in this aspect 
of the invention contain the above- identified gene segments in 
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an unrearranged form. Thus, interposed between' the V, D and J 
segments in the heavy chain transgene and between the V and J 
segments on the light chain transgene are appropriate 
recombination signal sequences (RSS's). In addition, such 

5 transgenes also include appropriate RNA splicing signals to 
join a constant region gene segment with the VJ or VDJ 
rearranged variable region. 

In order to facilitate isotype switching within a 
heavy chain transgene containing more than one C region gene 

10 segment, e.g. Cm and C7I from the human genome, as explained 
below "switch regions" are incorporated upstream from each of 
the constant region gene segments and downstream from the 
variable region gene segments to permit recombination between 
such constant regions to allow for immunoglobulin class 

15 switching, e.g. from IgM to IgG. Such heavy and light 

immunoglobulin transgenes also contain transcription control 
sequences including promoter regions situated upstream from 
the variable region gene segments which typically contain TATA 
motifs. A promoter region can be defined approximately as a 

20 DNA sequence that, when operably linked to a downstream 
sequence, can produce transcription of the downstream 
sequence. Promoters may require the presence of additional 
linked cis-acting sequences in order to produce efficient 
transcription. In addition, other sequences that participate 

25 in the transcription of sterile transcripts are preferably 
included. Examples of sequences that participate in 
expression of sterile transcripts can be found in the 
published literature, including Rothman et al. , Intl. Immunol. 
2:621-627 (1990); Reid et al., Proc. Natl . Acad. Sci. USA 

30 86:840-844 (1989); Stavnezer et al. , Proc. Natl. Acad. Sci. 
USA 8J5: 7704-7708 (1988); and Mills et al., Nucl, Acids Res. 
18:7305-7316 (1991) , each of which is incorporated herein by 
reference. These sequences typically include about at least 
50 bp immediately upstream of a switch region, preferably 

35 about at least 200 bp upstream of a switch region; and more 
preferably about at least 200-1000 bp or more upstream of a 
switch region. Suitable sequences occur immediately upstream 
of the .human S 7l , S y2 , S y3 , S y4 , S al , S a2 , and S f switch 
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regions, although the sequences immediately upstream of the 
human S y1 , and switch regions are preferable. In 
particular, interferon (IFH) inducible transcriptional 
regulatory elements, such as IFN-inducible enhancers, are 
5 preferably included immediately upstream of transgene switch 
sequences . 

in addition to promoters, other regulatory sequences 
which function primarily in B-lineage cells are used. Thus, 
for example, a light chain enhancer sequence situated 

10 preferably between the J and constant region gene segments on 
the light chain transgene is used to enhance transgene 
expression, thereby facilitating allelic exclusion. In the 
case of the heavy chain transgene, regulatory enhancers and 
also employed. Such regulatory sequences are used to maximize 

15 the transcription and translation of the transgene so as to 

induce allelic exclusion and to provide relatively high levels 
of transgene expression. 

Although the foregoing promoter and enhancer 
regulatory control sequences have been generically described, 

20 such regulatory sequences may be heterologous to the nonhuman 
animal being derived from the genomic DNA from which the 
heterologous transgene immunoglobulin gene segments are 
obtained. Alternately, such regulatory gene segments are 
derived from the corresponding regulatory sequences in the 

25 genome of the non-human animal, or closely related species, 
which contains the heavy and light transgene. 

In the preferred embodiments, gene segments are 
derived from human beings. The transgenic non-human animals 
harboring such heavy and light transgenes are capable of 

30 mounting an Ig-mediated immune response to a specific antigen 
administered to such an animal. " B-cells are produced within 
such an animal which are capable of producing heterologous 
human antibody. After immortalization, and the selection for 
an appropriate monoclonal antibody (Mab) , e.g. a hybridoma, a 

35 source of therapeutic human monoclonal antibody is provided, 
such human Mabs have significantly reduced immunogenicity when 
therapeutically administered to humans. 
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Although the preferred embodiments disclose the 
construction of heavy and light transgenes containing human 
gene segments, the invention is not so limited. In this 
regard, it is to be understood that the teachings described 
herein may be readily adapted to utilize immunoglobulin gene 
segments from a species other than human beings. For example, 
in addition to the therapeutic treatment of humans with the 
antibodies of the invention, therapeutic antibodies encoded by 
appropriate gene segments may be utilized to generate 
monoclonal antibodies for use in the veterinary sciences. 

Rearranged Transgenes 

In an alternative embodiment, transgenic nonhuman 
animals contain functionally at least one rearranged 
heterologous heavy chain immunoglobulin transgene in the 
germline of the transgenic animal. Such animals contain 
primary repertoire B-cells that express such rearranged heavy 
transgenes. Such B-cells preferably are capable of undergoing 
somatic mutation when contacted with an antigen to form a 
heterologous antibody having high affinity and specificity for 
the antigen. Said rearranged transgenes will contain at least 
two C H genes and the associated sequences required for isotype 
switching . 

The invention also includes transgenic animals 
containing germ line cells having heavy and light transgenes 
wherein one of the said transgenes contains rearranged gene 
segments with the other containing unrearranged gene segments. 
In such animals, the heavy chain transgenes shall have at 
least two C H genes and the associated sequences required for 

isotype switching. 

The invention further includes methods for 
generating a synthetic variable region gene segment repertoire 
. to be used in the transgenes of the invention. The method 
comprises generating a population of immunoglobulin V segment 
DNAs wherein each of the V segment DNAs encodes an 
immunoglobulin V segment and contains at each end a cleavage 
recognition site of a restriction endonuclease. The 
population of immunoglobulin V segment DNAs is thereafter 
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concatenated to form the synthetic immunoglobulin V segment 
repertoire. Such synthetic variable region heavy chain 
transgenes shall have at least two C H genes and the associated 
sequences required for isotype switching. 

Tfiotvpe switching 

In the development of a B lymphocyte, the cell 
initially produces IgM with a binding specificity determined 
by the productively rearranged V H and V L regions.. 
Subsequently, each B cell and its progeny cells synthesize 
antibodies with the same L and H chain V regions, but they may 
switch the isotype of the H chain. 

The use of n or 6 constant regions is largely 
determined by alternate splicing, permitting IgM and IgD to be 
coexpressed in a single cell. The other heavy chain isotypes 
(y, a, and e) are only expressed natively after a gene 
rearrangement event deletes the C M and CS exons. This gene 
rearrangement process, termed isotype switching, typically 
occurs by recombination between so called switch segments 
located immediately upstream of each heavy chain gene (except 
S) . The individual switch segments are between 2 and 10 kb xn 
length, and consist primarily of short repeated sequences. 
The exact point of recombination differs for individual class 
switching events. Investigations which have used solution 
hybridization kinetics or Southern blotting with cDNA-derxved 
Ch probes have confirmed that switching can be associated wxth 
loss of C H sequences from the cell. 

The switch (S) region of xhe n gene, S„, is located 
about 1 to 2 kb 5- to the coding sequence and is composed of 
numerous tandem repeats of sequences of the form 
<GAGCT) n (GGGGT), where n is usually 2 to 5 but can range as 
high as 17. (See T. Nikaido et al. Jiature 292:845-848 (1981)) 

Similar internally repetitive switch sequences 
spanning several kilobases have been found 5 • of the other C H 
genes. The So region has been sequenced and found to consxst 
of tandemly repeated 80-bp homology units, whereas S y2a , S Y2b , 
and S Y3 all contain repeated 49-bp homology units very similar 
to each other. (See, P. Szurek et al., J. Immunol 135:620-626 
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(1985) and T. Nikaido et al. f J= Riol- Chem. 257:7322-7329 
(1982), which are incorporated herein by reference.) All the 
sequenced S regions include numerous occurrences of the 
pentamers GAGCT and GGGGT that are the basic repeated elements 
of the S p gene (T. Nikaido et al., J. Biol. Chem. 252:7322- 
7329 (1982) which is incorporated herein by reference); in the 
other S regions these pentamers are not precisely tandemly 
repeated as in S p , but instead are embedded in larger repeat 
units. The S yl region has an additional higher-order 
structure: two direct repeat sequences flank each of two 
clusters of 49-bp tandem repeats. (See M. R. Mowatt et al., 
J. Immunol. 13_6: 2674-2683 (1986), which is incorporated herein 
by reference) . 

Switch regions of human H chain genes have been 
found to be very similar to their mouse homologs. Indeed, 
similarity between pairs of human and mouse clones 5' to the 
C H genes has been found to be confined to the S regions, a fact 
that confirms the biological significance of these regions. 

A switch recombination between /i and a genes 
produces a composite S^S a sequence. Typically, there is no 
specific site, either in S p or in any other S region, where 
the recombination always occurs. 

Generally, unlike the enzymatic machinery of V-J 
recombination, the switch machinery can apparently accommodate 
different alignments of the repeated homologous regions of 
germline S precursors and then join the sequences at different 
positions within the alignment. (See, T. H. Rabbits et al., 
Nucleic Acids Res . 9:4509-4524 (1981) and J. Ravetch et al. , 
Proc. Natl- Acad. Sci. USA 77:6734-6738 (1980), which are 
incorporated herein by reference.) 

The exact details of the mechanism (s) of selective 
activation of switching to a particular isotype are unknown. 
Although exogenous influences such as lymphokines and 
cytokines might upregulate isotype-specif ic recombinases , it 
is also possible that the same enzymatic machinery catalyzes 
switches to all isotypes and that specificity lies in 
targeting this machinery to specific switch regions. 
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The T-cell-derived lymphokines IL-4 and IFN, have 
been shown to specifically promote the expression of certain 
isotypes: IL-4 decreases IgM, lgG2a, lgG2b, and IgG3 
expression and increases IgE and IgGl expression; while IFN y 
selectively stimulates IgG2a expression and antagonizes the 
IL -4-induced increase in IgE and IgGl expression (Coffman et 
al., .t- Munol- 126:949-954 (1986) and Snapper et al. 
Science 236:944-947 (1987), which are incorporated herein by 
^e^ce) . A combination of IL-4 and IL-5 promotes igA 
expression (Coffman et al., ^JOBUM!- 139:3685-3690 (1987), 
which is incorporated herein by reference) • . 

Most of the experiments implicating T-cell effects 
on switching have not ruled out the possibility that the 
observed increase in cells with particular switch 
recombinations might reflect selection of preswitched or 
precommitted cells; but the most likely explanation is that 
the lymphokines actually promote switch recombination. 

induction of class switching appears to be 
related with sterile tran scripts 

(1988); stavnezer et al. , EEfiS. KM1« l c a — 

1988); Esser and Radbruch, BBL2- < 1989) ' f 

il p- nT1 ^ Ssi, °SA S*:2829 (1989) ,- Rothman et 

kj-mal- 2— C^. each of which is ">«^" d 
by Terence,. For example, the observed induction of the yl 
Irile transcript by IW and inhibition by IFN-, 
with the observation that IL-4 promotes class switching to yl 
in B-cells in culture, while IFH-, inhibits T l 
Therefore, the inclusion of regulatory seances affect 
the transcription of sterile transcripts may also affect the 
rate of isotype switching. For example, increa ""^ ^ 
transcription of a particular sterile transcript typically can 
be expected to enhance the frequency of isotype switch 
recombination involving adjacent switch sequences. 

For these reasons, it is preferable that transgenes 
incorporate transcriptional regulatory sequences ******* 
1-2 Z upstream of each switch region that is to be utilized 
for isotype switching. These transcriptional regulatory 
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sequences preferably include a promoter and an enhancer 
element, and more preferably include the 5' flanking (i.e., 
upstream) region that is naturally associated (i.e., occurs in 
aermline configuration) with a switch region. This 5- 
flanking region is typically about at least 50 nucleotides in 
length, preferably about at least 200 nucleotides in length, 
and more preferably at least 500-1000 nucleotides. 

Although a 5' flanking sequence from one switch 
region can be operably linked to a different switch region for 
transgene construction (e.g., the 5' flanking sequence from 
the human S 7l switch can be grafted immediately upstream of the 
S al switch) , in some embodiments it is preferred that each 
switch region incorporated in the transgene construct have the 
5' flanking region that occurs immediately upstream in the 
naturally occurring germline configuration. 

Transgenic Primary Repertoire 
A. The Human immunoglobu lin Locj 

An important requirement for transgene function is 
the generation of a primary antibody repertoire that is 
diverse enough to trigger a secondary immune response for a 
wide range of antigens. The rearranged heavy chain gene 
consists of a signal peptide exon, a variable region exon and 
a tandem array of multi-domain constant region regions, each 
of which is encoded by several exons. Each of the constant 
region genes encode the constant portion of a different class 
of immunoglobulins. During B-cell development, V region 
proximal constant regions are deleted leading to the 
expression of new heavy chain classes. For each heavy chain 
class, alternative patterns of RNA splicing give rise to both 
transmembrane and secreted immunoglobulins. 

The human heavy chain locus consists of 
approximately 200 V gene segments spanning 2 Mb, approximately 
30 D gene segments spanning about 40 kb, six J segments 
clustered within a 3 kb span, and nine constant region gene 
segments spread out over approximately 300 kb. The entire 
locus spans approximately 2.5 Mb of the distal portion of the 
long arm of chromosome 14. 
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B. Gene Fragment T ransgenes 
1. Heavy Ch *i" Transaene 

In a preferred embodiment, immunoglobulin heavy and 
light chain transgenes comprise unrearranged genomic DNA from 
humans. In the case of the heavy chain, a preferred transgene 
comprises a NotI fragment having a length between 670 to 830 
kb. The length of this fragment is ambiguous because the 3 • 
restriction site has not been accurately mapped. It is known, 
however, to reside between the al and *a gene segments. This 
fragment contains members of all six of the known V H families, 
the D and J gene segments, as well as the m, 6, 7 3, yl and al 
constant regions (Berman et al., EMBO.J. 7:727-738 (1988), 
which is incorporated herein by reference) . A transgenic 
mouse line containing this transgene correctly expresses a 
15 heavy chain class required for B-cell development (IgM) and at 
least one switched heavy chain class (IgG^ , in conjunction 
with a sufficiently large repertoire of variable regions to 
trigger a secondary response for most antigens. 



10 



20 2. r.iaht Cha in Transaene 

A genomic fragment containing all of the necessary 
gene segments and regulatory sequences from a human light 
chain locus may be similarly constructed. Such transgenes are 
constructed as described in the Examples. 

25 

C. Transgenes Generated Intracellularly 

by In Vivo Recombinat ion 

It is not necessary to isolate the all or part of 
the heavy chain locus on a single DNA fragment. Thus, for 
30 example, the 670-830 kb NotI fragment from the human 
" immunoglobulin heavy chain locus may be formed in vivo in the 
non-human animal during transgenesis. Such in vivo transgene 
construction is produced by introducing two or more 
overlapping DNA fragments into an embryonic nucleus of the 
35 non-human animal. The overlapping portions of the DNA 
fragments have DNA sequences which are substantially 
homologous. Upon exposure to the recombinases contained 
within the embryonic nucleus, the overlapping DNA fragments 
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homologously recombined in proper orientation to form the 
670-830 kb NotI heavy chain fragment. 

In vivo transgene construction can be used to form 
any number of immunoglobulin transgenes which because of their 
5 size are otherwise difficult, or impossible, to make or 
manipulate by present technology. Thus, in vivo transgene 
construction is useful to generate immunoglobulin transgenes 
which are larger than DNA fragments which may be manipulated 
by YAC vectors (Murray and Szostak, Nature 305:189-193 

10 (1983)). Such in vivo transgene construction may be used to 
introduce into a non-human animal substantially the entire 
immunoglobulin loci from a species not consisting of the 
transgenic non-human animal. 

In addition to forming genomic immunoglobulin 

15 transgenes, in vivo homologous recombination may also be 

utilized to form "mini-locus" transgenes as described in the 
Examples . 

In the preferred embodiments utilizing in vivo 
transgene construction, each overlapping DNA fragment 

20 preferably has an overlapping substantially homologous DNA 
sequence between the end portion of one DNA fragment and the 
end portion of a second DNA fragment. Such overlapping 
portions of the DNA fragments preferably comprise about 500 bp 
to about 2000 bp, most preferably 1.0 kb to 2.0 kb. 

25 Homologous recombination of overlapping DNA fragments to form 
transgenes in vivo is further described in commonly assigned 
PCT Publication No. WO 92/03917 entitled "Homologous 
Recombination in Mammalian Cells" published March 19, 1992. 

30 D. Minilocus Transgenes 

As used herein, the term "immunoglobulin minilocus" 
refers to a DNA sequence (which may be within a longer 
sequence) , usually of less than about 150 kb, typically 
between about 25 and 100 kb, containing at least one each of 

35 the following: a functional variable (V) gene segment, a 

functional joining (J) region segment, at least one functional 
constant (C) region gene segment, and — if it is a heavy chain 
minilocus — a functional diversity (D) region segment, such 
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that said DNA sequence contains at least one substantial 
discontinuity (e.g., a deletion, usually of at least about 2 
to 5 kb, preferably 10-25 kb or more, relative to the 
homologous genomic DNA sequence) . A light chain minilocus 
transgene will be at least 25 kb in length, typically 50 to 60 
kb. A heavy chain transgene will typically be about 70 to 80 
jcb in length, preferably at least about 60 kb with two 
constant regions operably linked to switch regions. 
Furthermore, the individual elements of the minilocus are 
preferably in the germline configuration and capable of 
undergoing gene rearrangement in the pre-B cell of a 
transgenic animal so as to express functional antibody 
molecules with diverse antigen specificities encoded entirely 
by the elements of the minilocus. Further, a heavy chaxn 
minilocus comprising at least two C H genes and the requisite 
switching sequences is typ.cally c pable of undergoing isotype 
switching, so that functional antibody molecules of different 
immunoglobulin classes will be generated. Such isotype 
switching may occur in vivo in B-cells residing within the 
transgenic nonhuman animal, or may occur in cultured cells of 
the B-cell lineage which have been explanted from the 
transgenic nonhuman animal. 

in an alternate preferred embodiment, immunoglobulin 
heavy chain transgenes comprise one or more of each of the V H , 
D, and J H gene segments and two or more of the C H genes. At 
least one of each appropriate type gene segment is 
incorporated into the minilocus transgene. With regard to the 
Cg segments for the heavy chain transgene, it is preferred 
that the transgene contain at least one n gene segment and at 
least one other constant region gene segment, more preferably 
a 7 gene segment, and most preferably y3 or yl. This 
preference is to allow for class switching between IgM and IgG 
forms of the encoded immunoglobulin and the production of a 
secretable form of high affinity non-IgM immunoglobulin. 
Other constant region gene segments may also be used such as 
those which encode for the production of igD, IgA and IgE. 

Those skilled in the art will also construct 
transgenes wherein the order of occurrence of heavy chain C H 
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genes will be different from the naturally-occurring spatial 
order found in the germline of the species serving as the 
donor of the C H genes. 

Additionally, those skilled in the art can select C H 
5 genes from more than one individual of a species (e.g., 
allogeneic C H genes) and incorporate said genes in the 
transgene as supernumerary C H genes capable of undergoing 
isotype switching; the resultant transgenic nonhuman animal 
may then, in some embodiments, make antibodies of. various 

10 classes including all of the allotypes represented in the 
species from which the transgene C H genes were obtained. 

Still further, those skilled in the art can select 
C H genes from different species to incorporate into the 
transgene. Functional switch sequences are included with each 

15 C H gene, although the switch sequences used are not 

necessarily those which occur naturally adjacent to the C H 
gene. Interspecies C H gene combinations will produce a 
transgenic nonhuman animal which may produce antibodies of 
various classes corresponding to C H genes from various 

20 species. Transgenic nonhuman animals containing interspecies 
C H transgenes may serve as the source of B-cells for 
constructing hybridomas to produce monoclonals for veterinary 
uses . 

The heavy chain J region segments in the human 
25 comprise six functional J segments and three pseudo genes 
clustered in a 3 kb stretch of DNA. Given its relatively 
compact size and the ability to isolate these segments 
together with the m gene and the 5 1 portion of the S gene on a 
single 23 kb SFil/Spel fragment (Sado et al., Biochem. 
30 Biophvs. Res. Comm. 154 ; 264271 (1988), which is incorporated 
herein by reference) > it is preferred that all of the J region 
gene segments be used in the mini-locus construct. Since this 
fragment spans the region between the m and S genes, it is 
likely to contain all of the 3 1 cis-linked regulatory elements 
35 required for M expression. Furthermore, because this fragment 
includes the entire J region, it contains the heavy chain 
enhancer and the m switch region (Mills et al. # Nature 306:809 
(1983); Yancopoulos and Alt, Ann. Rev- Tmmunol. 4:339-368 



WO 93/12227 



PCI7US92/10983 



42 

It also 



(1986), which are incorporated herein by reference), 
contains the transcription start sites which trigger VDJ 
joining to form primary repertoire B-cells (Yancopoulos and 
Alt cell 40:271-281 (1985), which is incorporated herein by 
reference) . Alternatively, a 36 Kb BssHII/Spell fragment, 
which includes part on the D region, may be used in place of 
the 23 kb Sfil/spell fragment. The use of such a fragment 
increases the amount of 5- flanking sequence to facilitate 

efficient D-to-J joining. 

The human D region consists of 4 or 5 homologous 9 
kb subregions, linked in tandem (Siebenlist, et al. (1981) , 
Nature . 2M, 631-635) . Each subregion contains up to 10 
individual D segments. Some of these segments have been 
mapped- and are shown in Fig. 4. Two different strategies are 
used to generate a mini-locus D region. The first strategy 
involves using only those D segments located m a short 
contiguous stretch of DNA that includes one or two of the 
repeated D subregions. A candidate is a single 15 kb fragment 
that contains 12 individual D segments. This piece of DNA 
consists of 2 contiguous EcoRI fragments and has been 
completely sequenced (Ichihara, et al. (1988), SffiQ_J., 1, 
4141-4150) . Twelve D segments should be sufficient for a 
primary repertoire. However, given the dispersed nature of 
the D region, an alternative strategy is to ligate together 
several non-contiguous D-segment containing fragments, to 
produce a smaller piece of DNA with a greater number of 
segments. Additional D-segment genes can be identified, for 
example, by the presence of characteristic flanking nonamer 
and heptamer sequences, supra, and by reference to the 
literature. 

At least one, and preferably more than one V gene 
segment is used to construct the heavy chain minilocus 
transgene. Rearranged or prearranged V segments with or 
without flanking sequences can be isolated as described PCT 
Publication No. WO .92/03918, published March 19, 1992, 
entitled "Transgenic Non-Human Animals Capable of Producing 
Heterologous Antibodies." 



WO 93/12227 



PCT/US92/10983 



A3 

Rearranged or unrearranged V segments, D segments, J 
segments, and C genes, with or without flanking sequences, can 
be isolated as described in PCT Publication No. WO 92/03918, 
published March 19, 1992. 
5 A minilocus light chain transgene may be similarly 

constructed from the human X or k immunoglobulin locus. 
Thus, for example, an immunoglobulin heavy chain minilocus 
transgene construct, e.g., of about 75 kb, encoding V, D, J 
and constant region sequences can be formed from a plurality 

10 of DNA fragments, with each sequence being substantially 

homologous to human gene sequences. Preferably, the sequences 
are operably linked to transcription regulatory sequences and 
are capable of undergoing rearrangement. With two or more 
appropriately placed constant region sequences (e.g., m and 7) 

15 and switch regions, switch recombination also occurs. An 
exemplary light chain transgene construct can be formed 
similarly from a plurality of DNA fragments, substantially 
homologous to human DNA and capable of undergoing 
rearrangement . 

20 

E. Transgene Constructs Capab le of Isotvpe Switching 

Ideally, transgene constructs that are intended to 
undergo class switching should include all of the cis-acting 
sequences necessary to regulate sterile transcripts. 

25 Naturally occurring switch regions and upstream promoters and 
regulatory sequences (e.g., IFN-inducible elements) are 
preferred cis-acting sequences that are included in transgene 
constructs capable of isotype switching. About at least 50 
basepairs, preferably about at least 200 basepairs, and more 

30 preferably at least 500 to 1000 basepairs or more of sequence 
immediately upstream of a switch region, preferably a human 71 
switch region, should be operably linked to a switch sequence, 
preferably a human 71 switch sequence. Further, switch 
regions can be linked upstream of (and adjacent to) C H genes 

35 that do not naturally occur next to the particular switch 
region. For example, but not for limitation, a human y x 
switch region may be linked upstream from a human a 2 C H gene, 
or a murine y 1 switch may be linked to a human C H gene. 
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An alternative method for obtaining non-classical 
isotype switching (e.g., ^-associated deletion) in transgenic 
mice involves the inclusion of the 4 00 bp direct repeat 
sequences (<x M and en) that flank the human n gene (Yasui et 
5 al. f fiit-- .T. Tmmunol. 19:1399 (1989)). Homologous 

recombination between these two sequences deletes the n gene 
in IgD-only B-cells. Heavy chain transgenes can be 
represented by the following formulaic description: 

10 (V H ) x -(D) y -(J H ) z -(S D ) ro -(C 1 ) n -[(T)-(S A ) p -(C 2 )] q 
where: 

V H is a heavy chain variable region gene segment, 

D is a heavy chain D (diversity) region gene segment, 

15 J H is a heavy chain J (joining) region gene segment, 

S D is a donor region segment capable of participating in 
a recombination event with the S a acceptor region 
segments such that isotype switching occurs, 
C x is a heavy chain constant region gene segment encoding 

20 " an isotype utilized in for B cell development (e.g., 

H or S) , 

T is a cis-acting transcriptional regulatory region 

segment containing at least a promoter, 
S A is an acceptor region segment capable of participating 
25 ^ in a recombination event with selected S D donor 

region segments, such that isotype switching occurs, 
C 2 is a heavy chain constant region gene segment encoding 

an isotype other than n (e.g., 7j./ 72' 73' ?4' a i» 

a 2 , e). 

30 x, y, z, m, n, p, and q are integers, x is 1-100, n is 

0-10 ,.y is 1-50, p is 1-1P, z is 1-50, q is 0-50, m 
is 0-10. Typically, when he transgene is capable 
of isotype switching, q must be at least 1, m is at 
least 1, n is at least 1, and m is greater than or 

35 equal to n. 
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V H * D, J H , S D , C 1# T, S A , and C 2 segments may be 
selected from various species, preferably mammalian species, 
and more preferably from human and murine germline DNA. 

V H segments may be selected from various species, 
5 but are preferably selected from V H segments that occur 
naturally in the human germline, such as V H251 . Typically 
about 2 V H gene segments are included, preferably about 4 V H 
segments are included, and most preferably at least about 10 
V H segments are included. 

10 At least one D segment is typically included, 

although at least 10 D segments are preferably included, and 
some embodiments include more than ten D segments. Some 
preferred embodiments include human D segments. 

Typically at least one J H segment is incorporated in 

15 the transgene, although it is preferable to include about six 
J H segments, and some preferred embodiments include more than 
about six J H segments. Some preferred embodiments include 
human J H segments, and further preferred embodiments include 
six human J H segments and no nonhuman J H segments. 

20 S D segments are donor regions capable of 

participating in recombination events with the S A segment of 
the transgene. For classical isotype switching, S D and S A are 
switch , regions such as S M# S j, S y 2, S y $, S y4 , S a , S a2 , and S g . 
Preferably the switch regions are murine or human, more 

25 preferably S D is a human or murine S^ and S A is a human or 

murine S y i* For nonclassical isotype switching (^-associated 
deletion) , S D and S A are preferably the 400 basepair direct 
repeat sequences that flank the human m gene. 

C x segments are typically m or 6 genes, preferably a 

30 /i gene, and more preferably a human or murine n gene. 

T segments typically include S 1 flanking sequences 
that are adjacent to naturally occurring (i.e., germline) 
switch regions. T segments typically at least about at least 
50 nucleotides in length, preferably about at least 200 

35 nucleotides in length, and more preferably at least 500-1000 
nucleotides in length. Preferably T segments are 5 » flanking 
sequences that occur immediately upstream of human or murine 
switch regions in a germline configuration. It is also 
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evident to those of skill in the art that T segments may 
comprise cis-acting transcriptional regulatory sequences that 
do not occur naturally in an animal germline (e.g., viral 
enhancers and promoters such as those found in SV40, 
5 adenovirus, and other viruses that infect eukaryotic cells). 

C 2 segments are typically a y lt y 2 , 73' 7 4 ' a i' a 2' 

„ „ u qene of these isotypes, and 

or e C H gene, preferably a human c" 

more preferably a human Yl or 73 aene - Murine ?2a and 7 2 b 
also be used, as may downstream (i.e., switched) isotype genes 
10 form various species. Where the heavy chain transgene 

contains an immunoglobulin heavy chain minilocus, the total 
length of the transgene will be typically 150 kilo basepairs 
or less. 

In general, the transgene will be other than a 
15 native heavy chain Ig locus. Thus, for example, deletion of 
unnecessary regions or substitutions with corresponding 
regions from other species will be present. 

F. Methods for Determining Functional 
20 ysotype switching j " Tg Transaenes 

The occurrence of isotype switching in a transgenic 
nonhuman animal may be identified by any method known to those 
in the art. Preferred embodiments include the following, 
25 employed either singly or in combination: 

1. detection of mRNA transcripts that contain a sequence 
homologous to at least one transgene downstream C H gene other 
than S and an adjacent sequence homologous to a transgene V H - 
D H -J H rearranged gene; such detection may be by Northern 

30 hybridization, S x nuclease protection assays, PCR 
amplification, cDNA cloning, or other methods; 

2. detection in the serum of the transgenic animal, or in 
supernatants of cultures of hybridoma cells made from B-cells 
of the transgenic animal, of immunoglobulin proteins encoded 

35 by downstream C H genes, where such proteins can also be shown 
by immunochemical methods to comprise a functional variable 
region; 

3. detection, in DNA from B-cells of the transgenic 
animal or in genomic DNA from hybridoma cells, of DNA 
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rearrangements consistent with the occurrence of isotype 
switching in the transgene, such detection may be accomplished 
by Southern blot hybridization, PCR amplification, genomic 
cloning, or other method; or 
5 4. identification of other indicia of isotype switching, 

such as production of sterile transcripts, production of 
characteristic enzymes involved in switching (e.g., "switch 
recombinase") , or other manifestations that may be detected, 
measured, or observed by contemporary techniques. 

10 Because each transgenic line may represent a 

different site of integration of the transgene, and a 
potentially different tandem array of transgene inserts, and 
because each different configuration of transgene and flanking 
DNA sequences can affect gene expression, it is preferable to 

15 identify and use lines of mice that express high levels of 
human immunoglobulins, particularly of the IgG isotype, and 
contain the least number of copies of the transgene. Single 
copy transgenics minimize the potential problem of incomplete 
allelic expression. Transgenes are typically integrated into 

20 host chromosomal DNA, most usually into germline DNA and 
propagated by subsequent breeding of germline transgenic 
breeding stock animals. However, other vectors and transgenic 
methods known in the present art or subsequently developed may 
be substituted as appropriate and as desired by a 

25 practitioner. 

G. Functional Disruption of 

Endogenous Immunoglobulin Loci 

30 The expression of successfully rearranged 

immunoglobulin heavy and light transgenes is expected to have 
a dominant effect by suppressing the rearrangement of the 
endogenous immunoglobulin genes in the transgenic nonhuman 
animal. However, another way to generate a nonhuman that is 

35 devoid of endogenous antibodies is by mutating the endogenous 
immunoglobulin loci. Using embryonic stem cell technology and 
homologous recombination, the endogenous immunoglobulin 
. repertoire can be readily eliminated. The following describes 
the functional description of the mouse immunoglobulin loci. 
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The vectors and methods disclosed, however, can be readily 
adapted for use in other non-human animals. 

Briefly, this technology involves the inactivation 
of a gene, by homologous recombination, in a pluripotent cell 
line that is capable of differentiating into germ cell tissue. 
A DNA construct that contains an altered, copy of a mouse 
immunoglobulin gene is introduced into the nuclei of embryonic 
stem cells. In a portion of the cells, the introduced DNA 
recombines with the endogenous copy of the mouse gene, 
replacing it with the altered copy. Cells containing the 
newly engineered genetic lesion are injected into a host mouse 
embryo, which is reimplanted into a recipient female. Some of 
these embryos develop into chimeric mice that possess germ 
cells entirely derived from the mutant cell line. Therefore, 
15 by breeding the chimeric mice it is possible to obtain a new 
line of mice containing the introduced genetic lesion 
(reviewed by Capecchi (1989), Science, 244. 1288-1292). 

Because the mouse X locus contributes to only 5% of 
the immunoglobulins, inactivation of the heavy chain and/or 
20 K-light chain loci is sufficient. There are three ways to 

disrupt each of these loci, deletion of the J region, deletion 
of the J-C intron enhancer, and disruption of constant region 
coding sequences by the introduction of a stop codon. The 
last option is the most straightforward, in terms of DNA 
25 construct design. Elimination of the p gene disrupts B-cell 
maturation thereby preventing class switching to any of the 
functional heavy chain segments. The strategy for knocking 
out these loci is outlined below. 

To disrupt the mouse p and ic genes, targeting 
30 vectors are used based on the design employed by Jaenisch and 
co-workers (Zijlstra, et al. (1989), Nature, 342, 435-438) for 
the successful disruption of the mouse 02-microglobulin gene. 
The neomycin resistance gene (neo) , from the plasmid pMCIneo 
is inserted into the coding region of the target gene. The 
35 pMCIneo insert uses a hybrid viral promoter/ enhancer sequence 
to drive neo expression. This promoter is active in embryonic 
stem cells. Therefore, neo can be used as a selectable marker 
for integration of the knock-out construct. The HSV thymidine 
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kinase (tk) gene is added to the end of the construct as a 
negative selection marker against random insertion events 
(Zijlstra, et al., supra . ) . 

A preferred strategy for disrupting the heavy chain 
5 locus is the elimination of the J region- This region is 
fairly compact in the mouse, spanning only 1.3 kb. To 
construct a gene targeting vector, a 15 kb Kpnl fragment 
containing all of the secreted A constant region exons from 
mouse genomic library is isolated. The 1.3 kb J region is 

10 replaced with the l.l kb insert from pMCIneo. The HSV tk gene 
is then added to the 5 1 end of the Kpnl fragment. Correct 
integration of this construct, via homologous recombination, 
will result in the replacement of the mouse J H region with the 
neo gene. Recombinants are screened by PCR, using a primer 

15 based on the neo gene and a primer homologous to mouse 
sequences 5 1 of the Kpnl site in the D region. 

Alternatively, the heavy-chain locus is knocked out 
by disrupting the coding region of the m gene. This approach 
involves the same 15 kb Kpnl fragment used in the previous 

20 approach. The 1.1 kb insert from pMCIneo is inserted at a 

unique BamHI site in exon II, and the HSV tk gene added to the 
3 1 Kpnl end. Double crossover events on either side of the 
neo insert, that eliminate the tk gene, are then selected for. 
These are detected from pools of selected clones by PCR 

25 amplification. One of the PCR primers is derived from neo 
sequences and the other from mouse sequences outside of the 
targeting vector. The functional disruption of the mouse 
immunoglobulin loci is presented in the Examples. 

30 

G. Suppressing Expression of 

Endogenous Immunoglobulin Loci 

In addition to functional disruption of endogenous 

Ig loci, an alternative method for preventing the expression 

35 of an endogenous Ig locus is suppression. Suppression of 

endogenous Ig genes may be accomplished with antisense RNA 

produced from one or more integrated transgenes, by antisense 

oligonucleotides, and/or by administration of antisera 

specific for one or more endogenous Ig chains. 
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mHsprise po lynucleotides 
Antisense RNA transgenes can be employed to 
partially or totally knock-out expression of specific genes 
(Pepin et al. (1991) Nature 355: 725; Helene. , C. and Toulme, 
5 j. (1990) Bioctoica B ^r h Y*- Acta 99; Stout, J. and 

Caskey, T. (1990) Somat, Cell MoT. Genet. 16: 369; Munir et 
al. (1990) M m - ^net. 16: 383, each of which is 

incorporated herein by reference) . 

"Antisense polynucleotides" are polynucleotides 

10 that: (1) are complementary, to all or part of a reference 
sequence, such as a sequence of an endogenous Ig C H or C L 
region, and (2) which specifically hybridize to a 
complementary target sequence, such as a chromosomal gene 
locus- or a Ig mRNA. Such complementary antisense 

15 polynucleotides may include nucleotide substitutions, 

additions, deletions, or transpositions, so long as specific 
hybridization to the relevant target sequence is retained as a 
functional property of the polynucleotide. Complementary 
antisense polynucleotides include soluble antisense RNA or DNA 

20 oligonucleotides which can hybridize specifically to 

individual mRNA species and prevent transcription and/ or RNA 
processing of the mRNA species and/or translation of the 
encoded polypeptide (Ching et al. , Pron. Natl. Acad. Sci, _ 
U.S.A. 8,6:10006-10010 (1989); Broder et al. , Ann. Int, Med . 

25 113:604-618 (1990); Loreau et al. , FEBS Letters 224:53-56 

(1990); Holcenberg et al., W091/11535; W091/09865; WO91/04753; 
WO90/13641; and EP 386563, each of which is incorporated 
herein by reference) . An antisense sequence is a 
polynucleotide sequence that is complementary to at least one 

30 immunoglobulin gene sequence of at least about 15 contiguous 
• nucleotides in length, typically at least 20 to 30 nucleotides 
in length, and preferably more than about 30 nucleotides in 
length. However, in some embodiments, antisense sequences may 
have substitutions, additions, or deletions as compared to the 

35 complementary immunoglobulin gene sequence, so long as 
specific hybridization is retained as a property of the 
antisense polynucleotide. Generally, an antisense sequence is 
complementary to an endogenous immunoglobulin gene sequence 
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that encodes, or has the potential to encode after DNA 
rearrangement, an immunoglobulin chain. In some cases, sense 
sequences corresponding to an immunoglobulin gene sequence may 
function to suppress expression, particularly by interfering 
5 with transcription. 

The antisense polynucleotides therefore inhibit 
production of the encoded polypeptide ( s) . In this regard, 
antisense polynucleotides that inhibit transcription and/ or 
translation of one or more endogenous Ig loci can alter the 

10 capacity and/or specificity of a non-human animal to produce 
immunoglobulin chains encoded by endogenous Ig loci. 

Antisense polynucleotides may be produced from a 
heterologous expression cassette in a transfectant cell or 
transgenic cell, such as a transgenic pluripotent 

15 hematopoietic stem cell used to reconstitute all or part of 
the hematopoietic stem cell population of an individual, or a 
transgenic nonhuman animal. Alternatively, the antisense 
polynucleotides may comprise soluble oligonucleotides that are 
administered to the external milieu, either in culture medium 

20 in vitro or in the circulatory system or interstitial fluid in 
vivo. Soluble antisense polynucleotides present in the 
external milieu have been shown to gain access to the 
cytoplasm and inhibit translation of specific mRNA species. In 
some embodiments the antisense polynucleotides comprise 

25 methylphosphonate moieties, alternatively phosphorothiolates 
or O-raethylribonucleotides may be used, and chimeric 
oligonucleotides may also be used (Dagle et al. (1990) Nucleic 
Acids Res. 18 ; 4751) . For some applications, antisense 
oligonucleotides may comprise polyamide nucleic acids (Nielsen 

30 et al. (1991) Science 254 ; 1497). For general methods 

relating to antisense polynucleotides, see Antisense RNA and 
DNA . (1988), D. A. Melton, Ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY). 

Antisense polynucleotides complementary to one or 

35 more sequences are employed to inhibit transcription, RNA 

processing, and/or translation of the cognate mRNA species and 
.thereby effect a reduction in the amount of the respective 
encoded polypeptide. Such antisense polynucleotides can 
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provide a therapeutic function by inhibiting the formation of 
one or more endogenous Ig chains in vi,vo . 

Whether as soluble antisense oligonucleotides or as 
antisense RNA transcribed from an antisense transgene, the 
5 antisense polynucleotides of this invention are selected so as 
to hybridize preferentially to endogenous Ig sequences at 
physiological conditions in vivo. Most typically, the 
selected antisense polynucleotides will not appreciably 
hybridize to heterologous Ig sequences encoded by a heavy or 
10 light chain transgene of the invention (i.e., the antisense 
oligonucleotides will not inhibit transgene Ig expression by 
more than about 25 to 35 percent) . 

flrvt-i ccriim sur-jression 
15 Partial or complete suppression of endogenous Ig 

chain expression can be produced by injecting mice with 
antisera against one or more endogenous Ig chains (Weiss et 
al. (1984) Pme. Natl, &rad. Sei- rU.S.A.) 81 211, Which is 
incorporated herein by reference) . Antisera are selected so 
20 as to react specifically with one or more endogenous Ig chains 
but to have minimal or no cross-reactivity with heterologous 
Ig chains encoded by an Ig transgene of the invention. Thus, 
administration of selected antisera according to a schedule as 
typified by that of Weiss et al. op.cit., will suppress 
25 endogenous Ig chain expression but permits expression of 

heterologous Ig chain (s) encoded by a transgene of the present 
invention. 



Mucleic Acids 

30 The nucleic acids, the term "substantial homology" 

indicates that two nucleic acids, or de gnated sequences 
thereof, when optimally aligned an compared, are identical, 
with appropriate nucleotide insert: ens or deletions, in at 
least about 80% of the nucleotides, usually at least about 90% 

35 to 95%, and more preferably at least about 98 to 99.5% of the 
nucleotides. Alternatively, substantial homology exists when 
the segments will hybridize under selective hybridization 
conditions, to the complement of the strand. The nucleic 
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acids may be present in whole cells, in a cell lysate, or in a 
partially purified or substantially pure form. A nucleic acid 
is "isolated" or "rendered substantially pure" when purified 
away from other cellular components or other contaminants, 
5 e.g., other cellular nucleic acids or proteins, by standard 
techniques, including alkaline/ SDS treatment, CsCl banding, 
column chromatography, agarose gel electrophoresis and others 
well known in the art. See . F. Ausubel, et al., ed. Current 
Protocols in Molecular Biology , Greene Publishing and Wiley- 

10 Interscience, New York (1987). 

The nucleic acid compositions of the present 
invention, while often in a native sequence (except for 
modified restriction sites and the like) , from either cDNA, 
genomic or mixtures may be mutated, thereof in accordance with 

15 standard techniques to provide gene sequences. For coding 

sequences, these mutations, may affect amino acid sequence as 
desired. In particular, DNA sequences substantially 
homologous to or derived from native V, D, J, constant, 
switches and other such sequences described herein are 

20 contemplated (where "derived" indicates that a sequence is 
identical or modified from another sequence) . 

A nucleic acid is "operably linked" when it is 
placed into a functional relationship with another nucleic 
acid sequence. For instance, a promoter or enhancer is 

25 operably linked to a coding sequence if it affects the 

transcription of the sequence. With respect to transcription 
regulatory sequences, operably linked means that the DNA 
sequences being linked are contiguous and, where necessary to 
join two protein coding regions, contiguous and in reading 

30 frame. For switch sequences, operably linked indicates that 
the sequences are capable of effecting switch recombination. 

Specific Preferred Embodiments 

A preferred embodiment of the invention is an animal 
35 containing at least one, typically 2-10, and sometimes 25-50 
or more copies of the transgene described in Example 12 (e.g., 
pHCl or pHC2) bred with an animal containing a single copy of 
a light chain transgene described in Examples 5, 6, 8, or 14 , 
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and the offspring bred with the J H deleted animal described m 
Example 10. Animals are bred to homozygosity for each of 
these three traits. Such animals have the following genotype: 
a single copy (per haploid set of chromosomes) of a human 
heavy chain prearranged mini-locus (described in Example 12) , 
a single copy (per haploid set of chromosomes) of a rearranged 
human k light chain construct (described in Example 14), and a 
deletion at each endogenous mouse heavy chain locus that 
removes all of the functional J H segments (described in 
Example 10). Such animals are bred with mice that are 
homozygous for the deletion of the J H segments (Examples 10) 
to produce offspring that are homozygous for the J H deletion 
and hemizygous for the human heavy and light chain constructs. 
The resultant animals are injected with antigens and used for 
production of human monoclonal antibodies against these 
antigens. 

B cells isolated from such an animal are 
monospecific with regard to the human heavy and light chains 
because they contain only a single copy of each gene. 
Furthermore, they will be monospecific with regards to human 
or mouse heavy chains because both endogenous mouse heavy 
chain gene copies are nonfunctional by virtue of the deletion 
spanning the J H region introduced as described in Example 9 
and 12. Furthermore, a substantial fraction of the B cells 
will be monospecific with regards to the human or mouse light 
chains because expression of the single copy of the rearranged 
human k light chain gene will allelically and isotypically 
exclude the rearrangement of the endogenous mouse k and X 
chain genes in a significant fraction of B-cells. 

The transgenic mouse of the preferred embodiment 
will exhibit immunoglobulin production with a significant 
repertoire, ideally substantially similar to that of a native 
mouse. Thus, for example, in embodiments where the endogenous 
ig genes have been inactivated, the total immunoglobulin 
levels will range from about 0.1 to 10 mg/ml of serum, 
preferably 0.5 to 5 mg/ml, ideally at least about 1.0 mg/ml. 
When a transgene capable of effecting a switch to IgG from IgM 
has been introduced into the transgenic mouse, the adult mouse 
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ratio of serum IgG to IgM is preferably about 10:1. Of 
course, the IgG to IgM ratio will be much lower in the 
immature mouse. In general, greater than about 10%, 
preferably 40 to 80% of the spleen and lymph node B cells 
5 express exclusively human IgG protein. 

The repertoire will ideally approximate that shown 
in a non-transgenic mouse, usually at least about 10% as high, 
preferably 25 to 50% or more. Generally, at least about a 
thousand different immunoglobulins (ideally IgG),. preferably 

10 10 4 to 10* or more, will be produced, depending primarily on 
the number of different V, J and D regions introduced into the 
mouse genome. These immunoglobulins will typically recognize 
about one-half or more of highly antigenic proteins, 
Including, but not limited to: pigeon cytochrome C, chicken 

15 lysozyme, pokeweed mitogen, bovine serum albumin, keyhole 
limpit hemocyanin, influenza hemagglutinin, staphylococcus 
protein A, sperm whale myoglobin, influenza neuraminidase, and 
lambda repressor protein. Some of the immunoglobulins will 
exhibit an affinity for preselected antigens of at least about 

20 lO 7 *!" 1 , preferably lO 8 ^ 1 to lO 9 *!" 1 or greater. 

Thus, prior to rearrangement of a transgene 
containing various heavy or light chain gene segments, such 
gene segments may be readily identified, e.g. by hybridization 
or DNA sequencing, as being from a species of organism other 

25 than the transgenic animal. 

Although the foregoing describes a preferred 
embodiment of the transgenic animal of the invention, other 
embodiments are defined by the disclosure herein and more 
particularly by the transgenes described in the Examples. 

30 Four categories of transgenic animal may be defined: 

I, Transgenic animals containing an unrearranged heavy 
and rearranged light immunoglobulin transgene. 

II. Transgenic animals containing an unrearranged heavy 
and unrearranged light immunoglobulin transgene 

. 35 III. Transgenic animal containing rearranged heavy and an 

unrearranged light immunoglobulin transgene, and 
IV. Transgenic animals containing rearranged heavy and 
rearranged light immunoglobulin transgenes. 
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Of these categories of transgenic animal, the 
preferred order of preference is as follows II > I > HI > IV 
where the endogenous light chain genes (or at least the k 
gene) have been knocked out by homologous recombination (or 
5 other method) and I > II > IH >IV where the endogenous light 
chain genes have not been knocked out and must be dominated by 
allelic exclusion. 

EXPERIMENTAL EXAMPLES 

10 METHODS A W" MATERIALS 

Transgenic mice are derived according to Hogan, et 
al., "Manipulating the Mouse Embryo: A Laboratory Manual", 
Cold Spring Harbor Laboratory, which is incorporated herein by 

reference. . 

Embryonic stem cells are manipulated according to 
published procedures (Teratocarcinomas and embryonic stem 
cells: a practical approach, E.J. Robertson, ed. , IRL Press, 
Washington, D.C., 1987; Zjilstra et al. , Nature M2-.435-438 
(1989); and Schwartzberg et al., Science 246:799-803 (1989), 
each of which is incorporated herein by reference) . 

DMA cloning procedures are carried out according to 
j. sambrook, et al. in Molecular Cloning: A Laboratory 
Manual, 2d ed. , 1989, Cold Spring Harbor Laboratory Press, 
cold Spring Harbor, N.Y. , which is incorporated herein by 
25 reference. 

Oligonucleotides are synthesized on an Applied Bio 
Systems oligonucleotide synthesizer according to 
specifications provided by the manufacturer. 

Hybridoma cells and antibodies are manipulated 
30 according to "Antibodies: A Laboratory Manual", Ed Harlow and 
• David Lane, Cold Spring Harbor Laboratory (1988) , which is 
incorporated herein by reference. 

BYAMPLE 1 

35 r. e nnmi c Heavy Ch; .-n Human Tq Transqene 

This Example describes the cloning and 
microinjection of a human genomic heavy chain immunoglobulin 
transgene which is microinjected into a murine zygote. 



20 
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Nuclei are isolated from fresh human placental 
tissue as described by Marzluff et al., "Transcription and 
Translation: A Practical Approach", B.D. Hammes and 
S.J. Higgins, eds., pp. 89-129, IRL Press, Oxford (1985)). 
5 The isolated nuclei (or PBS washed human spermatocytes) are 
embedded in a low melting point agarose matrix and lysed with 
EDTA and proteinase k to expose high molecular weight DNA, 
which is then digested in the agarose with the restriction 
enzyme NotI as described by M. Finney in Current Protocols in 

10 Molecular Biology (F. Ausubel, et al., eds. John Wiley & Sons, 
Supp. 4, 1988, Section 2.5.1). 

The NotI digested DNA is then fractionated by pulsed 
field gel electrophoresis as described by Anand et al., 
wu'el. Acids Res . 12:3425-3433 (1989). Fractions enriched for 

15 the NotI fragment are assayed by Southern hybridization to 
detect one or more of the seguences encoded by this fragment. 
Such sequences include the heavy chain D segments, J segments, 
H and 71 constant regions together with representatives of all 
6 VH families (although this fragment is identified as 670 kb 

20 fragment from HeLa cells by Berman et al. (1988), supra. , we 
have found it to be as 830 kb fragment from human placental an 
sperm DNA) . Those fractions containing this NotI fragment 
(see Fig. 4) are pooled and cloned into the NotI site of the 
vector pYACNN in Yeast cells. Plasmid pYACNN is prepared by 

25 digestion of pYAC-4 Neo (Cook et al., Nucleic Acids Res. 16: 
11817 (1988)) with EcoRI and ligation in the presence of the 
oligonucleotide 5 ' - AAT TGC GGC CGC - 3 ' . 

YAC clones containing the heavy chain NotI fragment 
are isolated as described by Brownstein et al. , Science 

30 244:1348-1351 (1989), and Green et al., Proc. Natl. Acad. Sci, 
USA 87:1213-1217 (1990), which are incorporated herein by 
reference. The cloned NotI insert is isolated from high 
molecular weight yeast DNA by pulse field gel electrophoresis 
as described by M. Finney, op cit. The DNA is condensed by 

35 the addition of 1 mM spermine and microinjected directly into 
the nucleus of single cell embryos previously described. 
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EXAMPLE 2 



Genomic k Light Chain Human Ig Transgene 
Formed bv Tn Vivo Homologou s Recombination 

5 A map of the human k light chain has been described * 

in Lorenz et al. , Nucj Acids Res. 15:9667-9677 (1987), which 
is incorporated herein by reference. 

A 450 kb Xhol to NotI fragment that includes all of 
C^, the 3' enhancer, all J segments, and at least five 
10 different V segments is isolated and microinjected into the 
nucleus of single cell embryos as described in Example 1. 

EXAMPLE 3 

15 Genomic k Light Chain Human Ig Transgene 

Formed by Tn Vivo Homolog ous Recombination 

A 750 kb Mlul to NotI fragment that includes all of 
the above plus at least 20 more V segments is isolated as 

20 described in Example 1 and digested with BssHII to produce a 
fragment of about 400 kb. 

The 450 kb Xhol to NotI fragment plus the 
approximately 400 kb Mlul to BssHII fragment have sequence 
overlap defined by the BssHII and Xhol restriction sites. 

25 Homologous recombination of these two fragments upon 
microinjection of a mouse zygote results in a transgene 
containing at least an additional 15-20 V segments over that 
found in the 450 kb XhoI/NotI fragment (Example 2) . 

30 EXAMPLE 4 

rnn^mction of H*aw Chain Mini-Locus 
A. Constructi on of pGPI and PGP2 

pBR322 is digested with EcoRI and Sty I and ligated 
with the following oligonucleotides to generate pGPl which 
35 contains a 147 base pair insert containing the restriction 9 
sites shown in Fig. 8. The general overlapping of these 
oligos is also shown in Fig. 9. 
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The oligonucleotides are: 



5 


oligo-1 


5» 


- CTT 
TTT 


GAG 
TTG 


CCC 
CAT 


GCC 
ACT 


TAA 
GCG 


TGA 
GCC 


GCG 

- 3' 


GGC 

i 


TTT 


oligo-2 


5' 


- GCA 
GCA 


ATG 
TCG 


GCC 
ATA 


TGG 
TCT 


ATC 
AGA 


CAT 
GCT 


GGC 
CGA 


GCG 
GCA 


CTA 
-3' 


10 


oligo-3 


5' 


- TGC 


AGA 


TCT 
CGT 


GAA 
ACT 


TTC 
AGT 


CCG 
GCG 


GGT 
GCC 


ACC 
GCT 


AAG 
-3 1 




oligo-4 


5» 


- AAT 
GCT 


TAG 
TGG 


CGG 
TAC 


CCG 
CCG 


CAC 
GGA 


TAG 
ATT 


TAC 
- 3 


GCG 

i 


TAA 


15 


oligo-5 


5' 


- CAG 
TCG 


ATC 
ATG 


TGC 
CTA 


ATG 
GCG 


CTC 
CGC 


GAG 
CAT 


CTC 
GGA 


TAG 
TCC 


ATA 
- 3 




oligo-6 


5" 


- AGG 
AAA 


CCA 
AGC 


TTG 
CCG 


CGG 
CTC 


CCG 
ATT 


CAG 
AGG 


TAT 
CGG 


GCA 
GCT 


AAA 
- 3 



20 



This plasmid contains a large polylinker flanked by 
rare cutting NotI sites for building large inserts that can be 
isolated from vector sequences for microinjection. The 
plasmid is based on pBR322 which is relatively low copy 
25 compared to the pUC based plasmids (pGPl retains the pBR322 
copy number control region near the origin of replication) . 
Low copy number reduces the potential toxicity of insert 
sequences. In addition, pGPl contains a strong transcription 
terminator sequence derived from trpA (Christie et al. , Proc. 
30 Natl. Acad. Sci. USA 78:4180 (1981)) inserted between the 

ampicillin resistance gene and the polylinker. This further 
reduces the toxicity associated with certain inserts by 
preventing readthrough transcription coming from the 
ampicillin promoters. 
35 Plasmid pGP2 is derived from pGPl to introduce an 

additional restriction site (Sfil) in the polylinker. pGPl is 
digested with Mlul and Spel to cut the recognition sequences 
in the polylinker portion of the plasmid. 

The following adapter oligonucleotides are ligated 
40 to the thus digested pGPl to form pGP2. 



5 1 CGC GTG GCC GCA ATG GCC A 3 ' 
5' CTA GTG GCC ATT GCG GCC A 3' 
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pGP2 is identical to pGPl except that it contains an 
additional Sfi I site located between the Mlul and Spel sites. 
This allows inserts to be completely excised with Sfil as well _ 
as with Notl. 

5 % 

B . r^nstructi nn nf D RE3 [gat enhancer 3') 

An enhancer sequence located downstream of the rat 
constant region is included in the heavy chain constructs. 

Tue heavy chain region 3 • enhancer described by 
10 Petterson et al., Nature 344:165-168 (1990), which is 

incorporated herein by reference) is isolated and cloned. The 
rat IGH 3' enhancer sequence is PCR amplified by using the 
following oligonucleotides: 

15 5* CAG GAT CCA GAT ATC AGT ACC TGA AAC AGG GCT TGC 3' 
5' GAG CAT GCA CAG GAC CTG GAG CAC ACA CAG CCT TCC 3« 

The thus formed double stranded DNA encoding the 3 • 
enhancer is cut with BamHI and SphI and clone into BamHI/SphI 
20 cut pGP2 to yield pRE3 (rat enhancer 3 • ) . 

C. r] o ping o -F Human J-u Region 

A substantial portion of this region is cloned by 
combining two or more fragments isolated from phage lambda 

25 inserts. See Fig. 9. 

A 6.3 kb BamHI/Hindlll fragment that includes all 
human J segments (Matsuda et al. , SESL^, 2:1047-1051 (1988); 
Ravetech et al.m cell, 22:583-591 (1981), which are 
incorporated herein by reference) is isolated from human 
30 genomic DNA library using the oligonucleotide GGA CTG TGT CCC , 
TGT GTG ATG CTT TTG ATG TCT GGG GCC AAG. - 

An adjacent 10 kb Hindlll/Bamll fragment that ? 
contains enhancer, switch and constant region coding exons 
(Yasui et al., SHE, Immunol. 19:1399-1403 (1989)) is 
35 similarly isolated- using the oligonucleotide: 

CAC CAA GTT GAC CTG CCT GGT CAC AGA CCT GAC CAC CTA TGA 

An adjacent 3' 1.5 kb BamHI fragment is similarly 
isolated using clone pMOM insert as probe. (pMOM is 4 kb 
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EcoRI/Hindlll fragment isolated from human genomic DNA library 
with oligonucleotide: 

CCT GTG GAC CAC CGC CTC CAC CTT CAT 
CGT CCT CTT CCT CCT 
5 mu membrane exon 1) and cloned into pUC19. 

pGPl is digested with BamHI and Bglll followed by 
treatment with calf intestinal alkaline phosphatase. 

Fragments (a) and (b) from Fig. 9 are cloned in the 
digested pGPl. A clone is then isolated which is oriented 
10 such that 5' BamHI site is destroyed by BamHI/Bgl fusion. It 
is identified as pMU (see Fig. 10) . pMU is digested with 
BamHI and fragment (c) from Fig. 9 is inserted. The 
orientation is checked with Hindlll digest. The resultant 
plasmid pHIGl (Fig. 10) contains an 18 kb insert encoding J 
15 and Cm segments. 

D. cloning of Cu Region 

pGPl is digested with BamHI and Hindlll is followed 
by treatment with calf intestinal alkaline phosphatase (Fig. 

20 14). The so treated fragment (b) of Fig. 14 and fragment (c) 
of Fig. 14 are cloned into the BamHI /Hindlll cut pGPl. Proper 
orientation of fragment (c) is checked by Hindlll digestion to 
form pCONl containing a 12 kb insert encoding the C/i region. 

Whereas pHIGl contains J segments, switch and n 

25 sequences in its 18 kb insert with an Sfil 3' site and a Spel 
5« site in a poly linker flanked by NotI sites, will be used 
for rearranged VDJ segments. pCONl is identical except that 
it lacks the J region and contains only a 12 kb insert. The 
use of pCONl in the construction of fragment containing 

30 rearranged VDJ segments will be described hereinafter. 

E. cloning of *v-l Const ant Region (pREG2) 

The cloning of the human 7-1 region is depicted in 

Fig. 16. 

35 Yamamura et al., Proc. Nat l. Acad. Sci. USA 

82:2152-2156 (1986) reported the expression of membrane bound 
human 7-1 from a transgene construct that had been partially 
deleted on integration. Their results indicate that the 3' 
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BamHI site delineates a sequence that includes the 
transmembrane rearranged and switched copy of the gamma gene 
with a V-C intron of less than 5kb. Therefore, in the 
unrearranged, unswitched gene, the entire switch region is 
5 included in a sequence beginning less than 5 kb from the 5 • 
end of the first Y -l constant exon. Therefore it is included 
in the 5' 5.3 kb HindHI fragment (Ellison et al. , Nucleic 
Acids Res. 1014071-4079 (1982), which is incorporated herein 
by reference). Takahashi et al. , Cell 2£: 671-679 (1982), 

10 which is incorporated herein by reference, also reports that 
this fragment contains the switch sequence, and this fragment 
together with the 7.7 kb HindHI to BamHI fragment must 
include all of the sequences we need for the transgene 
construct. An intronic sequence is a nucleotide sequence of 

15 at least 15 contiguous nucleotides that occurs in an intron of 

a specified gene. 

Phage clones containing the y-1 region are 
identified and isolated ^ing the following oligonucleotide 
which is specific for the third exon of y-I (CH3) . 



20 



5« TGA GCC ACG AAG ACC CTG AGG 

TCA AGT TCA ACT GGT ACG TGG 3' 



A 7.7 kb HindHI to Bglll fragment (fragment (a) in 
25 Fig. 11) is cloned into Hindlll/Bglll cut pRE3 to form pREGl. 
The upstream 5.3 kb HindHI fragment (fragment (b) in Fig. 11) 
is cloned into HindHI digested pREGl to form pREG2 . Correct 
orientation is confirmed by BamHI/Spel digestion. 



30 F. combinin g Cy and Cu 

The previously described plasmid pHIGl contains 
human J segments and the Cu constant region exons. To provide 
a transgene containing the Cu constant region gene segments, 
pHIGl was digested with Sf il (Fig. 10) . The plasmid pREG2 was 
also digested with Sfil to produce a 13.5 kb insert containing 
human Cy exons and the rat 3« enhancer sequence. These 
sequences were combined to produce the plasmid pHIG3« (Fig. 
12) containing the human J segments, the human Cm constant 



35 
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10 



region, the human C 7 l constant region and the rat 3« enhancer 
contained on a 31.5 kb insert. 

A second plasmid encoding human Cm and human C7I 
without J segments is constructed by digesting pCONl with Sfil 
and combining that with the Sfil fragment containing the human 
C7 region and the rat 3' enhancer by digesting pREG2 with 
Sfil. The resultant plasmid, pCON (Fig. 12) contains a 26 kb 
Notl/Spel insert containing human Cm, human 71 and the rat 3' 
enhancer sequence. 



G. cloning of D Segment 

The strategy for cloning the human D segments is 
depicted in Fig. 13. Phage clones from the human genomic 
library containing D segments are identified and isolated 
15 using probes specific for diversity region sequences (Ichihara 
et al. r embo J . 7:4141-4150 (1988)). The following 
oligonucleotides are used: 

DXPl: 5' ~ TGG TAT TAC TAT GGT TCG GGG AGT TAT TAT 

20 AAC CAC AGT GTC - 3' 

DXP4: 5* - GCC TGA AAT GGA GCC TCA GGG CAC AGT GGG 

CAC GGA CAC TGT - 3 1 

25 DN4: 5' - GCA GGG AGG ACA TGT TTA GGA TCT GAG GCC 

GCA CCT GAC ACC - 3 ' 

A 5.2 kb Xhol fragment (fragment (b) in Fig. 13) 
containing DLR1, DXPl, DXP'l, and DAI is isolated from a phage 
30 clone identified with oligo DXPl. 

A 3.2 kb Xbal fragment (fragment (c) in Fig. 13) 
containing DXP4, DA4 and DK4 is isolated from a phage clone 
identified with oligo DXP4. 

Fragments (b) , (c) and (d) from Fig. 13 are combined 
35 and cloned into the Xbal/Xhol site of pGPl to form pHIG2 which 
contains a 10.6 kb insert. 

This cloning is performed sequentially. First, the 
5.2 kb fragment (b) in Fig. 13 and the 2.2 kb fragment (d) of 
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Fig. 13 are treated with calf intestinal alkaline phosphatase 
and cloned into P GP1 digested with Xhol and Xbal. The 
resultant clones are screened with the 5.2 and 2.2 kb insert. 
Half of those clones testing positive with the 5.2 and 2.2 kb 
inserts have the 5.2 kb insert in the proper orientation as 
determined by BamHI digestion. The 3.2 kb Xbal fragment from 
Fig 13 is then cloned into this intermediate plasmid 
containing fragments (b) and (d) to form P HIG2. This plasmid 
contains diversity segments cloned into the polylinker with a 
unique 5« Sfil site and unique 3' Spel site. The entire 
polylinker is flanked by NotI sites. 



H . rnn^mctio ^ Heavy ^hain Mini locus, 

The following describes the construction of a human 
heavy chain mini-locus which contain one or more V segments. 

An unrearranged V segment corresponding to that 
identified as the V segment contained in the hybridoma 
of Newkirk et al., .T. Clin. Invest, 81:1511-1518 (1988), which 
is incorporated herein by reference, is isolated using the 
following oligonucleotide: 

5' - GAT CCT GGT TTA GTT AAA GAG GAT TTT 
ATT CAC CCC TGT GTC - 3' 

A restriction map of the unrearranged V segment is 
determined to identify unique restriction sites which provide 
upon digestion a DNA fragment having a length approximately 2 
kb containing the unrearranged V segment together with 5' and 
3« flanking sequences. The 5' prime sequences will include 
promoter and other regulatory sequences whereas the 3' 
flanking sequence provides recombination sequences necessary 
for V-DJ joining. This approximately 3.0 kb V segment insert 
is cloned into the polylinker of pGB2 to form pVHl. 

pVHl is digested with Sfil and the resultant 
fragment is cloned into the Sfil site of pHIG2 to form a 
P HIG5'. Since pHIG2 contains D segments only, the resultant 
pHIG5« plasmid contains a single V segment together with D 
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segments. The size of the insert contained in pHIG5 is 10.6 
kb plus the size of the V segment insert. 

The insert from pHIG5 is excised by digestion with 
NotI and Spel and isolated. pHIG3 • which contains J, Cm and 
5 cyl segments is digested with Spel and NotI and the 3' kb 
fragment containing such sequences and the rat 3« enhancer 
sequence is isolated. These two fragments are combined and 
ligated into NotI digested pGPl to produce pHIG which contains 
insert encoding a V segment, nine D segments, six functional J 
10 segments, Cm, Cy and the rat 3' enhancer. The size of this 
insert is approximately 43 kb plus the size of the V segment 
insert. 

I. construction of Heavy Chain Minilocus 

15 by Homologous Recombin ation 

As indicated in the previous section, the insert of 
pHIG is approximately 43 to 45 kb when a single V segment is 
employed. This insert size is at or near the limit of that 
which may be readily cloned into plasmid vectors. In order to 

20 provide for the use of a greater number of V segments, the 
following describes in vivo homologous recombination of 
overlapping DNA fragments which upon homologous recombination 
within a zygote or ES cell form a transgene containing the rat 
3\ enhancer sequence, the human Cm, the human C7I, human J 

25 segments, human D segments and a multiplicity of human V 
segments . 

A 6.3 kb BamHI/Hindlll fragment containing human J 
segments (see fragment (a) in Fig. 9) is cloned into Mlul/Spel 
digested pHIGS* using the following adapters: 

30 

5 • GAT CCA AGC AGT 3 • 
5' CTA GAC TGC TTG 3 1 

5« CGC GTC GAA CTA 3 1 

35 

5» AGC TTA GTT CGA 3» 

The resultant is plasmid designated pHIG5'0 
40 (overlap) . The insert contained in this plasmid contains 
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human V, D and J segments. When the single V segment from 
pVHl is used, the size of this insert is approximately 17 kb 
plus 2 kb. This insert is isolated and combined with the 
insert from pHIG3' which contains the human J, Cm, Yl and rat 
5 3- enhancer sequences. Both inserts contain human J segments 
which provide for approximately 6.3 kb of overlap between the 
two DNA fragments. When coinjected into the mouse zygote, in 
vivo homologous recombination occurs generating a transgene 
equivalent to the insert contained in pHIG. 

10 This approach provides for the addition of a 

multiplicity of V segments into the transgene formed in vivo. 
For example, instead of incorporating a single V segment into 
P HIG5«, a multiplicity of V segments contained on (1) isolated 
genomic DNA, (2) ligated DNA derived from genomic DNA, or (3) 

15 DNA encoding a synthetic V segment repertoire is cloned into 
PHIG2 at the Sfil site to generate pHIGS' V N . The J segments 
fragment (a) of Fig. 9 is then cloned into pHIGS' V N and the 
insert isolated. This insert now contains a multiplicity of V 
segments and J segments which overlap with the J segments 

20 contained on the insert isolated from pHIG3 • . When 

cointroduced into the nucleus of a mouse zygote, homologous 
recombination occurs to generate in vivo the transgene 
encoding multiple V segments and multiple J segments, multiple 
D segments, the Cm region, the Cyl region (all from human) and 

25 the rat 3' enhancer sequence. 

EY AMPLE 5 

rvingtruetion of T.iaht Chajn vHnilocus 

A. CoBstrngt isE of oEul 
30 The construction of pEMl is depicted in Fig. 16. 

The mouse heavy chain enhancer is isolated on the Xbal to 
EcoRI 678 bp fragment (Banerji et al., Cell 33:729-740 (1983)) 
from phage clones using oligo: 

5* GAA TGG GAG TGA GGC TCT CTC ATA CCC 
J ° TAT TCA GAA CTG ACT 3' 
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This E/i fragment is cloned into EcoRV/Xbal digested 

pGPl by blunt end filling in EcoRI site. The resultant 
plasmid is designated pEmul. 

5 B. Construction Of k Light chain Minilocus 

The k construct contains at least one human V K 
segment, all five human J x segments, the human J-C K enhancer, 
human k constant region exon, and, ideally, the human 3* k 
enhancer (Meyer et al., EMBO J . 8:1959-1964 (1989)). The k 
10 enhancer in mouse is 9 kb downstream from C K . However, it is 
as yet unidentified in the human. In addition, the construct 
contains a copy of the mouse heavy chain J-Cm enhancers. 

The minilocus is constructed from four component 

fragments: 

15 (a) A 16 kb Smal fragment that contains the human 

exon and the 3' human enhancer by analogy with the mouse 
locus ; 

(b) A 5 1 adjacent 5 kb Smal fragment, which 
contains all five J segments; 

20 (c) The mouse heavy chain intronic enhancer 

isolated from pE/ul (this sequence is included to induce 
expression of the light chain construct as early as possible 
in B-cell development. Because the heavy chain genes are 
transcribed earlier than the light chain genes, this heavy 

25 chain enhancer is presumably active at an earlier stage than 
the intronic k enhancer) ; and 

(d) A fragment containing one or more V segments* 
The preparation of this construct is as follows. 
Human placental DNA is digested with Smal and fractionated oh 

30 agarose gel by electrophoresis. Similarly, human placental 
DNA is digested with BamHI and fractionated by 
electrophoresis. The 16 kb fraction is isolated from the Smal 
. digested gel and the 11 kb region is similarly isolated from 
the gel containing DNA digested with BamHI. 

35 The 16 kb Smal fraction is cloned into Lambda FIX II 

(Stratagene, La Jolla, California) which has been digested 
with Xhol, treated with klenow fragment DNA polymerase to fill 
in the Xhol restriction digest product. Ligation of the 16 kb 
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smal fraction destroys the Seal sites and lases Xhol sites 
intact . 

The 11 kb BamHI fraction is cloned into X EMBL3 
(Strategene, La Jolla, California) which is digested with 

BamHI prior to cloning. 

Clones from each library were probed with the Ck 

specific oligo: 

5« GAA CTG TGG CTG CAC CAT CTG TCT 
TCA TCT TCC CGC CAT CTG 3 ' 



A 



A 16 kb Xhol insert that was subcloned into the Xhol 
cut pEfil so that Ck is adjacent to the Smal site. The 

15 resultant plasmid was designated pKapl. 

The above Ck specific oligonucleotide is used to 
probe the X EMBL3 /BamHI library to identify an 11 kb clone, 
5 kb Smal fragment (fragment (b) in Fig. 20) is subcloned and 
subsequently inserted into pKapl digested with Smal. Those 

20 plasmids containing the correct orientation of J segments, Ck 
and the E/t enhancer are designated pKa P 2. 

one or more V< segments are thereafter subcloned 
into the Mlul site of P Kap2 to yield the plasmid pKapH which 
encodes the human Vic segments, the human Jk segments, the 

25 human C* segments and the human E M enhancer. This insert is 
excised by digesting pKapH with NotI and purified by agarose 
gel electrophoresis. The thus purified insert is 
microinjected into the pronucleus of a mouse zygote as 
previously described. 

30 c Construction of k Light chain Minilocus by 
I n Vivo wtvnoloao n g Pecomhi nation 

The 11 kb BamHI fragment is cloned into BamHI 
35 digested pGPl such that the 3« end is toward the Sfil site 
The resultant plasmid is designated pKAPint. One or more V* 
segments is inserted into the polylinker between the BamHI and 
spel sites in pKAPint to form pKapHV. The insert of pKapHV is 
excised by digestion with NotI and purified. The insert from 
40 P Kap2 is excised by digestion with NotI and purified. Each of 
these fragments contain regions of homology in that the 
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fragment from pKapHV contains a 5 kb sequence of DNA that 
include the J x segments which is substantially homologous to 
the 5 kb Smal fragment contained in the insert obtained from 
pKap2. As such, these inserts are capable of homologously 
5 recombining when microinjected into a mouse zygote to form a 
transgene encoding V K , J K and C K . 

EXAMPLE 6 

10 Isolation of Genomic Clones 

Corresponding to Rearranged and Expressed 
Copies of Immunoglobulin k Liaht Chai n Genes 

This example describes the cloning of immunoglobulin 
15 k light chain genes from cultured cells that express an 

immunoglobulin of interest. Such cells may contain multiple 
alleles of a given immunoglobulin gene. For example, a 
hybridoma might contain four copies of the k light chain gene, 
two copies from the fusion partner cell line and two copies 
20 from the original B-cell expressing the immunoglobulin of 
interest • Of these four copies, only one encodes the 
immunoglobulin of interest, despite the fact that several of 
them may be rearranged. The procedure described in this 
example allows for the selective cloning of the expressed copy 
25 of the k light chain. 

A. Double Stranded cDNA 

Cells from human hybridoma, or lymphoma, or other 
cell line that synthesizes either cell surface or secreted or 

30 both forms of IgM with a k light chain are used for the 
isolation of polyA+ RNA. The RNA is then used for the 
synthesis of oligo dT primed cDNA using the enzyme reverse 
transcriptase. The single. stranded cDNA is then isolated and 
G residues are added to the 3 1 end using the enzyme 

35 polynucleotide terminal transferase. The Gtailed 

single-stranded cDNA is then purified and used as template for 
second strand synthesis (catalyzed by the enzyme DNA 
polymerase) using the following oligonucleotide as a primer: 

40 5 1 - GAG GTA CAC TGA CAT ACT GGC ATG CCC 

CCC CCC CCC - 3' 
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The double stranded cDNA is isolated and used for 
determining the nucleotide sequence of the 5- end of the mRNAs 
encoding the heavy and light chains of the expressed 
immunoglobulin molecule. Genomic clones of these expressed 
5 genes are then isolated. The procedure for cloning the 
expressed light chain gene is outlined in part B below. 

B. T.ioht Chain 

The double stranded cDNA described in. part A is 
10 denatured and used as a template for a third round of DNA 
synthesis using the following oligonucleotide primer: 

5' - GTA CGC CAT ATC AGC TGG ATG AAG TCA TCAGAT 
GGC GGG AAG ATG AAG.ACA GAT GGT GCA - 3 

15 .. 

This primer contains sequences specific for the 
constant portion of the k light chain message (TCA TCA GAT GGC 
GGG AAG ATG AAG ACA GAT GGT GCA) as well as unique sequences 

20 that can be used as a primer for the PGR amplification of the 
newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG 
AAG) . The sequence is amplified by PCR using the following 
two oligonucleotide primers: 

5' - GAG GTA CAC TGA CAT ACT GGC ATG -3' 
" 5 i _ GTA CGC CAT ATC AGC TGG ATG AAG -3' 

The PCR amplified sequence is then purified by gel 
30 electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

51 — GAG GTA CAC TGA CAT ACT GGC ATG -3 • 

The first 42 nucleotides of sequence will then be 
used to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 
42 nucleotide segment of DNA will be referred to below as 
o-kappa. 

A Southern blot of DNA, isolated from the Ig 
expressing cell line and digested individually and in pairwise 
combinations with several different restriction endonucleases 



35 



40 
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including Smal, is then probed with the 3 2-P labelled unique 
oligonucleotide o-kappa. A unique restriction endonuclease 
site is identified upstream of the rearranged V segment. 

DNA from the Ig expressing cell line is then cut 
5 with Smal and second enzyme (or BaroHI or Kpnl if there is Smal 
site inside V segment) . Any resulting non-blunted ends are 
treated with the enzyme T4 DNA polymerase to give blunt ended 
DNA molecules. Then add restriction site encoding linkers 
(BamHI, EcoRI or Xhol depending on what site does not exist in 

10 fragment) and cut with the corresponding linker enzyme to give 
DNA fragments with BamHI , EcoRI or Xhol ends. The DNA is then 
size fractionated by agarose gel electrophoresis, and the 
fraction including the DNA fragment covering the expressed V 
segment is cloned into lambda EMBL3 or Lambda FIX (Stratagene, 

15 La Jolla, California) . V segment containing clones are 

isolated using the unique probe o-kappa. DNA is isolated from 
positive clones and subcloned into the polylinker of pKapl. 
The resulting clone is called pRKL. 

20 EXAMPLE 7 

Isolation of Genomic Clones 
Corresponding to Rearranged Expressed Copies 
of Immunoglobulin Heavy Chai n u Genes 

This example describes the cloning of immunoglobulin 
25 heavy chain n genes from cultured cells of expressed and 

immunoglobulin of interest. The procedure described in this 
example allows for the selective cloning of the expressed copy 
of a m heavy chain gene. 

Double-stranded cDNA is prepared and isolated as 
30 described herein before. The double-stranded cDNA is 

denatured and used as a template for a third round of DNA 
synthesis using the following oligonucleotide primer: 

5' - GTA CGC CAT ATC AGC TGG ATG AAG ACA GGA GAC 
35 GAG GGG GAA AAG GGT TGG GGC GGA TGC - 3' 



This primer contains sequences specific for the 
constant portion of the m heavy chain message (ACA GGA GAC GAG 
GGG GAA AAG GGT TGG GGC GGA TGC) as well as unique sequences 
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that can be used as a primer for the PGR amplification of the 
newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG 
AAG) . The sequence is amplified by FCR using the following 
two oligonucleotide primers: 

5 • - GAG GTA CAC TGA CAT ACT GGC ATG - 3 ' 
5» - GTA CTC CAT ATC AGC TGG ATG AAG - 3 • 

The PCR amplified sequence is then purified by gel 
electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

5' - GAG GTA CAC TGA CAT ACT GGC ATG - 3 r 

The first 42 nucleotides of sequence are then used 
to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 
42 nucleotide segment of DNA will be referred to below as 
o-mu. 

A southern blot of DNA, isolated from the Ig 
expressing cell line and digested individually and in pairwise 
combinations with several different restriction endonucleases 
including Mini (Mlul is a rare cutting enzyme that cleaves 
between the J segment and mu CHI) , is then probed with the 
32-P labelled unique oligonucleotide o-mu. A unique 
restriction endonuclease site is identified upstream of the 

rearranged V segment. 

DNA from the Ig expressing cell line is then cut 
with Mlul and second enzyme. Mlul or Spel adapter linkers are 
then ligated onto the ends and cut to convert the upstream 
site to Mlul orSpel. The DNA is then size fractionated by 
agarose gel electrophoresis, and the fraction including the 
DNA fragment covering the expressed V segment is cloned 
directly into the plasmid pGPI. V segment containing clones 
are isolated using the unique probe o-mu, and the insert is 
subcloned into Mlul or Mlul/Spel cut plasmid pC0N2. The 
resulting plasmid is called pRMGH. 
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EXAMPLE 8 

construction of Human * Minilnci Transoenes 
T.-ight Chain Minilocus 

A human genomic DNA phage library was screened with 
5 kappa light chain specific oligonucleotide probes and isolated 
clones spanning the J K -c region. A 5.7 kb Clal/Xhol fragment 
containing J K l together with a 13 kb Xhol fragment containing 
J 2-5 and C into pGPld was cloned and used to create the 
plasmid pKcor. This plasmid contains J K l-5, the kappa 
10 intronic enhancer and C K together with 4.5 kb of 5' and 9 kb 
of 3' flanking seguences. It also has a unigue 5' Xhol site 
for cloning V K segments and a unigue 3' Sail site for 
inserting additional cis-acting regulatory seguences. 

15 V kappa genes 

A human genomic DNA phage library was screened with 
V K light chain specific oligonucleotide probes and isolated 
clones containing human V K segments. Functional V segments 
were identified by DNA seguence analysis. These clones 

20 contain TATA boxes, open reading frames encoding leader and 
variable peptides (including 2 cysteine residues) , splice 
seguences, and recombination heptamer-12 bp spacer-nonamer 
seguences. Three of the clones were mapped and seguenced. 
Two of the clones, 65.5 and 65.8 appear to be functional, they 

25 contain TATA boxes, open reading frames encoding leader and 
variable peptides (including 2 cysteine residues), splice 
seguences, and recombination heptamer-12 bp spacer-nonamer 
seguences. The third clone, 65.4, appears to encode a V K I 
pseudogene as it contains a non-canonical recombination 

30 heptamer*. 

One of the functional clones, Vk 65-8, which encodes 
a Vklll family gene, was used to build a light chain minilocus 
construct. 

35 pKCI 

The kappa light chain minilocus transgene pKCI (Fig. 
32) was generated by inserting a 7.5 kb Xhol/Sall fragment 
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containing V K 65.8 into the 5« Xhol site of pKcor. The 
transgene insert was isolated by digestion with NotI prior to 
in j ection . 

The purified insert was microinjected into the 
pronuclei of fertilized (C57BL/6 x CBA)F2 mouse embryos and 
transferred the surviving embryos into pseudopregnant females 
as described by Hogan et al. (in Methods of Manipulating the 
Mouse Embryo, 1986, Cold Spring Harbor Laboratory, New York). 
Mice that developed from injected embryos were analyzed for 
the presence of transgene sequences by Southern blot analysis 
of tail DNA. Transgene copy number was estimated by band 
intensity relative to control standards containing known 
quantities of cloned DNA. Serum was isolated from these 
animals and assayed for the presence of transgene encoded 
human Ig kappa protein by ELISA as described by Harlow and 
Lane (in Antibodies: A Laboratory Manual, 1988, Cold Spring 
Harbor Laboratory, New York) . Microtiter plate wells were 
coated with mouse monoclonal antibodies specific for human Ig 
kappa (clone 6E1, #0173, AMAC, Inc., Westbrook, ME), human IgM 
(Clone AF6, #0285, AMAC, Inc., Westbrook, ME) and human IgGl 
(clone JL512, #0280, AMAC, Inc., Westbrook, ME). Serum 
samples were serially diluted into the wells and the presence 
of specific immunoglobulins detected with affinity isolated 
alkaline phosphatase conjugated goat anti-human Ig 
25 (polyvalent) that had been pre-adsorbed to minimize cross- 
reactivity with mouse immunoglobulins. 

Fig. 35 shows the results of an ELISA assay of serum 
from 8 mice (I.D. #676, 674, 673, 670, 666, 665, 664, and 
496) . The first seven of these mice developed from embryos 
30 that were injected with the pKCl transgene insert and the 
eighth mouse is derived from a mouse generated by 
microinjection of the pHCl transgers (described previously) . 
Two of the seven mice from KC1 injected embryos (I.D.#'s 666 
and 664) did not contain the transgene insert as assayed by 
35 DAN southern blot analysis, and five of the mice (I.D.#'s 676, 
674, 673, 670, and 665) contained the transgene. All but one 
of the KC1 transgene positive animals express detectable 
levels of human Ig. kappa protein, and the single non- 
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expressing animal appears to be a genetic mosaic on the basis 
of DNA Southern blot analysis. The pHCl positive transgenic 
mouse expresses human IgM and IgGl but not Ig kappa, 
demonstrating the specificity of the reagents used in the 
5 assay. 

PKC2 

The kappa light chain minilocus transgene pKC2 was 
generated by inserting an 8 kb Xhol/Sall fragment containing 
10 V K 65.5 into the 5' Xhol site of pKCl. The resulting 

transgene insert, which contains two V K segments, was isolated 
prior to microinjection by digestion with Notl. 



15 . 

pKVe? 

This construct is identical to pKCl except that it 
includes 1.2 kb of additional seguence 5' of J K and is missing 
4.5 kb of sequence 3' of V K 65.8. In additional it contains a 

20 0.9 kb Xbal fragment containing the mouse heavy chain J-ra 
intronic enhancer (Banerji et al., Ceil 23:729-740 (1983)) 
together with a 1.4 kb Mlul Hindlll fragment containing the 
human heavy chain J-m intronic enhancer (Hayday et al., Nature 
307 :334-340 (1984)) inserted downstream. This construct tests 

25 the feasibility of initiating early rearrangement of the light 
chain minilocus to effect allelic and isotypic exclusion. 
Analogous constructs can be generated with different 
enhancers, i.e., the mouse or rat 3' kappa or heavy chain 
enhancer (Meyer and Neuberger, EMBQ J . 8:1959-1964 (1989); 

30 Pettersori et al. Nature 344:165-168 (1990), which are 
incorporated herein by reference) . 

Rearranged Light Chain Tr ansaenes 

A kappa light chain expression cassette was designed 
35 to reconstruct functionally rearranged light chain genes that 
have been amplified by PCR from human B-cell DNA. The scheme 
is outlined in Fig. 33. PCR amplified light chain genes are 
cloned into the vector pK5nx that includes 3.7 kb of 5' 
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flanking sequences isolated from the kappa light chain gene 
65.5. The VJ segment fused to the 5' transcriptional 
sequences are then cloned into the unique Xhol site of the 
vector pK3ls that includes Jjc 2-4, the J K intronic enhancer, C K , 
5 and 9 kb of downstream sequences. The resulting plasmid 

contains a reconstructed functionally rearranged kappa light 
chain transgene that can be excised with NotI for 
microinjection into embryos. The plasmids also contain unique 
sail sites at the 3 « end for the insertion of additional cis- 

10 acting regulatory sequences. 

Two synthetic oligonucleotides (o-130, o-131) were 
used j amplify rearranged kappa light chain genes from human 
spleen genomic DNA. Oligonucleotide o-131 (gga ccc aga 
(g ,c)gg aac cat gga a(g,a) (g,a,t,c) ) is complementary to the 

15 5' region of V K III family light chain genes and overlaps the 
first ATC of the leader sequence. Oligonucleotide o-130 (gtg 
caa tea att etc gag ttt gac tac aga c) is complementary to a 
sequence appr , ximately 150 bp 3- of J,l and includes an Xhol 
site. These two oligonucleotides amplify a 0.7 kb DNA 

20 fragment from human spleen DNA corresponding to rearranged 

VII genes joined to J K 1 segments. The PGR amplified DNA was 
digested with Ncol and Xhol and cloned individual PGR products 
into the plasmid pNN03. The DNA sequence of 5 clones was 
determined and identified two with functional VJ joints (open 

25 reading frames). Additional functionally rearranged light 

chain clones are collected. The functionally rearranged clones 
can be individually cloned into light chain expression 
cassette described above (Fig. 33). Transgenic mice generated 
with the rearranged light chain constructs can be bred with 

30 heavy chain minilocus transgenics to produce a strain of mice 
' that' express a spectrum of fully human antibodies in which all 
of the diversity of the primary repertoire is contributed by 
the heavy chain. One source of light chain diversity can be 
from somatic mutation. Because not all light chains will be 

35 equivalent with respect to their ability to combine with a 
variety of different heavy chains, different strains of mice, 
each containing different light chain constructs can be 
generated and tested. The advantage of this scheme, as 
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. opposed to the use of unrearranged light chain miniloci, is 
the increased light chain allelic and isotypic exclusion that 
comes from having the light chain ready to pair with a heavy 
chain as soon as heavy chain VDJ joining occurs. This 
5 combination can result in an increased frequency of B-cells 
expressing fully human antibodies, and thus it can facilitate 
the isolation of human Ig expressing hybridomas. 

NotI inserts of plasmids pIGMl, pHCl, pIGGl, pKCl, 
and pKC2 were isolated away from vector sequences by agarose 

10 gel electrophoresis. The purified inserts were microinjected 
into the pronuclei of fertilized (C57BL/6 x CBA)F2 mouse 
embryos and transferred the surviving embryos into 
pseudopregnant females as described by Hogan et al. (Hogan et 
al. , Methods of Manipula ting the Mouse Embrvo. Cold Spring 

15 Harbor Laboratory, New York (1986)). 

EXAMPLE 9 

jpaetivation of the Mouse Kappa L i ght Chain Gene by Homologous 
Recombination 

20 

This example describes the inactivation of the mouse 
endogenous kappa locus by homologous recombination in 
embryonic stem (ES) cells followed by introduction of the 
mutated gene into the mouse germ line by injection of targeted 

25 ES cells bearing an inactivated kappa allele into early mouse 
embryos (blastocysts) . 

The strategy is to delete J K and C K by homologous 
recombination with a vector containing DNA sequences 
homologous to the mouse kappa locus in which a 4.5 kb segment 

30 of the locus, spanning the J K gene and C K segments, is deleted 
and replaced by the selectable marker neo. 

pftTigtmcfcion of the kappa tar geting vector 

The plasmid pGEM7 (KJ1) contains the neomycin 
35 resistance gene (neo) , used for drug selection of transf ected 
ES cells, under the transcriptional control of the mouse 
phosphoglycerate kinase (pgk) promoter (Xbal/TaqI fragment; 
Adra et al., Gene 60:65-74 (1987)) in the cloning vector pGEM- 
7Zf(+). The plasmid also includes a heterologous 
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polyadenylation site for the neo gene, derived from the 3 
region of the mouse pgk gene (PvuII/HindHI fragment; Boer et 
al. f B^ochemlcaj Generics, 28:299-308 (1990)). This plasmid 
was used as the starting point for construction of the kappa 
5 targeting vector. The first step was to insert sequences 
homologous to the kappa locus 3 • of the neo expression 

cassette. . 

• Mouse kappa chain sequences (Fig. 20a) were isolated 

from a genomic phage library derived from liver DNA using 

10 oligonucleotide probes specific for the Ck locus: 

5 .- GGC TGA TGC TGC ACC AAC TGT ATC CAT CTT CCC ACC ATC CAG 
-3' 

and for the Jk5 gene segment: 
15 5'- CTC ACG TTC GGT GCT GGG ACC AAG CTG GAG CTG AAA CGT AAG - 

3». 

An 8 kb Bglll/Sacl fragment extending 3' of the 
mouse C* segment was isolated from a positive phage clone in 

20 two pieces, as a 1.2 kb Bglll/Sacl fragment and a 6.8 kb SacI 
fragment, and subcloned into Bglll/Sacl digested pGEM7 (KJ1) 
to generate the plasmid pNEO-K3' (Fig. 20b) . 

A 1.2 kb EcoRI/SphI fragment extending 5' of the J K 
region was also isolated from a positive phage clone. An 

25 Sphl/Xbal/Bglll/EcoRI adaptor was ligated to the SphI site of 
this fragment, and the resulting EcoRI fragment was ligated 
into EcoRI digested P NEO-K3', in the same 5- to 3- orientation 
as the neo gene and the downstream 3- kappa sequences, to 
generate pNEO-K5 * 3 • (Fig. 20c). 

30 The Herpes Simplex Virus (HSV) thymidine kinase (TK) 

gene was then included in the construct in order to allow for 
enrichment of ES clones bearing homologous recombinants, as 
described by Mansour et al. , Nature 336:348-352 (1988), which 
is incorporated herein by reference. The HSV TK cassette was 

35 obtained from the plasmid pGEM7 (TK) , which contains the 
structural sequences for the HSV TK gene bracketed by the 
mouse pgk promoter and polyadenylation sequences as described 
above for pGEM7 (KJ1) . The EcoRI site of pGEM7 (TK) was 
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modified to a BamHI site and the TK cassette was then excised 
as a BamHI /Hindlll fragment and subcloned into pGPlb to 
generate pGPlb-TK. This plasmid was linearized at the Xhol 
site and the Xhol fragment from pNEO-K5 ' 3 1 , containing the neo 
gene flanked by genomic sequences from 5' of Jjc and 3' of Ck , 
was inserted into pGPlb-TK to generate the targeting vector 
J/C KI (Fig. 20d) . The putative structure of the genomic 
kappa locus following homologous recombination with J/C Kl is 
shown in Fig. 20e. 



Generation and analysis of ES ce lls with targeted inactivation 
of a kappa allele 

The ES cells used were the AB-1 line grown on 
mitotically inactive SNL76/7 cell feeder layers (McMahon and 

15 Bradley, Cell 62 ; 1073-1085 (1990)) essentially as described 
(Robertson, E.J. (1987) in Teratoca rcinomas and Embryonic Stem 
Cells: A Practical Approach . E.J. Robertson, ed. (Oxford: IRL 
Press) , p. 71-112) . Other suitable ES lines include, but are 
not limited to, the E14 line (Hooper et al. (1987) Nature 2£6: 

20 292-295), the D3 line (Doetschman et al. (1985) J. Embrvol. 
Exp. Moroh. 82: 27-45), and the CCE line (Robertson et al. 
(1986) Nature 323 : 445-448) . The success of generating a 
mouse line from ES cells bearing a specific targeted mutation 
depends on the pluripotence of the ES cells (i.e., their 

25 ability, once injected into a host blastocyst, to participate 
in embryogenesis and contribute to the germ cells of the 

resulting animal) . 

The pluripotence of any given ES cell line can vary 
with time in culture and the care with which it has been 

30 handled. The only definitive assay for pluripotence is to 
determine whether the specific population of ES cells to be 
used for targeting can give rise to chimeras capable of 
germline transmission of the ES genome. For this reason, 
prior to gene targeting, a portion of the parental population 

35 of AB-1 cells is injected into C57B1/6J blastocysts to 

ascertain whether the cells are capable of generating chimeric 
mice with extensive ES cell contribution and whether the 
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majority of these chimeras can transmit the ES genome to 
progeny. 

The kappa chain inactivation vector J/C Kl was 
digested with NotI and electroporated into AB-1 cells by the 
5 methods described (Hasty et al. , Mature, 1^0:243-246 (1991)). 
Electroporated cells were plated onto 100 mm dishes at a 
density of 1-2 x 10 6 cells/dish. After 24 hours, G418 
(200 M g/ml of active component) and FIAU (0.5 M M) were added to 
the medium, and drug-resistant clones were allowed to develop 
10 over 10-11 days. Clones wer-- picked, trypsinized, divided 
into two portions, and further expanded. Half of the cells 
derived from each clone were then frozen and the other half 
analyzed for homologous recombination between vector and 

target sequences. 

15 DNA analysis was carried out by Southern blot 

hybridization. DNA was isolated from the clones as described 
(Laird et al. , Sag] ^ids Res. 19:4293 (1991)) digested with 
Xbal and probed with the 800 bp EcoRI/Xbal fragment indicated 
in Fig. 20e as probe A. This probe detects a 3.7 kb Xbal 

20 fragment in the wild type locus, and a diagnostic 1.8 kb band 
in a locus which has homologously recombined with the 
targeting vector (see Fig. 20a and e) . Of 901 G418 and FIAU 
resistant clones screened by Southern blot analysis, 7 
displayed the 1.8 kb Xbal band indicative of a homologous 

25 recombination into one of the kappa genes. These 7 clones 
were further digested with the enzymes Bglll, Sad, and PstI 
to verify that the vector integrated homologously into one of 
the kappa genes. When probed with the diagnostic 800 bp 
EcoRI/Xbal fragment (probe A), Bglll, Sad, and PstI digests 

30 of wild type DNA produce fragments of 4.1, 5.4, and 7 kb, 

respectively, whereas the presence of a targeted kappa allele 
would be indicated by fragments of 2.4, ' 3, and 5.7 kb r 
respectively (see Fig. 20a and e) . All positive clones 
detected by the Xbal digest showed the expected Bglll, Sad, 

35 and PstI restriction fragments diagnostic of a homologous 

recombination at the kappa light chain. In addition, Southern 
blot analysis of an Nsil digest of the targeted clones using a 
neo specific probe (probe B, Fig. 20e) generated only the 
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predicted fragment of 4.2 kb, demonstrating that the clones 
each contained only a single copy of the targeting vector. 

Generation of mice bearing the inactivated kappa chain 
5 Five of the targeted ES clones described in the 

previous section were thawed and injected into C57B1/6J 
blastocysts as described (Bradley, A. (1987) in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach , E.J. Robertson, ed. (Oxford: IRL Press), p. 113-151) 

10 and transferred into the uteri of pseudopregnant females to 
generate chimeric mice resulting from a mixture of cells 
derived from the input ES cells and the host blastocyst. The 
extent of ES cell contribution to the chimeras can be visually 
estimated by the amount of agouti coat coloration, derived 

15 from the ES cell line, on the black C57B1/6J background. 

Approximately half of the offspring resulting from blastocyst 
injection of the targeted clones were chimeric (i.e., showed 
agouti as well as black pigmentation) and of these, the 
majority showed extensive (70 percent or greater) ES cell 

20 contribution to coat pigmentation. The AB1 ES cells are an XY 
cell line and a majority of these high percentage chimeras 
were male due to sex conversion of female embryos colonized by 
male ES cells. Male chimeras derived from 4 of the 5 targeted 
clones were bred with C57BL/6J females and the offspring 

25 monitored for the presence of the dominant agouti coat color 
indicative of germline transmission of the ES genome. 
Chimeras from two of these clones consistently generated 
agouti offspring. Since only one copy of the kappa locus was 
targeted in the injected ES clones, each agouti pup had a 50 

30 percent chance of inheriting the mutated locus. Screening for 
the targeted gene was carried out by Southern blot analysis of 
Bgl II-digested DNA from tail biopsies, using the probe 
utilized in identifying targeted ES clones (probe A, Fig. 
20e) . As expected, approximately 50 percent of the agouti 

35 offspring showed a hybridizing Bgl II band of 2.4 kb in 

addition to the wild-type band of 4.1 kb, demonstrating the 
germline transmission of the targeted kappa locus. 
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In order to generate mice homozygous for the 
mutation, heterozygotes were bred together and the kappa 
genotype of the offspring determined as described above. As 
expected, three genotypes were derived from the heterozygote 
matings: wild-type mice bearing two copies of a normal kappa 
locus, heterozygotes carrying one targeted copy of the kappa 
gene and one NT kappa gene, and mice homozygous for the kappa 
mutation. The deletion of kappa sequences from these latter 
mice was verified by hybridization of the Southern blots with 
a probe specific for J K (probe C, Fig. 20a) . Whereas 
hybridization of the J K probe was observed to DNA samples from 
heterozygous and wild-type siblings, no hybridizing signal was 
present in the homozygotes, attesting to the generation of a 
novel mouse strain in which both copies of the kappa locus 
have been inactivated by deletion as a result of targeted 
mutation. 



EXAMPLE 10 

j^m-iva+Ann of thp Mouse Heav y Chain Ge nP by Homologous 

20 Recombination 

This example describes the inactivation of the 
endogenous murine immunoglobulin heavy chain locus by 
homologous recombination in embryonic stem (ES) cells. The 
strategy is to delete the endogenous heavy chain J segments by 
homologous recombination with a vector containing heavy chain 
sequences from which the J H region has been deleted and 
replaced by the gene for the selectable marker neo. 

mnsti-uctinn nf a h ^ aw chain targeting vector . 

Mouse heavy chain sequences containing the J H region 
(Fig. 2la) were isolated from a genomic phage library derived 
from the D3 ES cell line (Gossler et al., Pron. Natl. Acad. 
an,. n.s.A. 83:9065-9069 (1986)) using a J H 4 specific 
oligonucleotide probe: 



35 



5«- ACT ATG CTA TGG ACT ACT GGG GTC AAG GAA CCT CAG TCA CCG 
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A 3.5 kb genomic SacI/StuI fragment, spanning the J H 
region, was isolated from a positive phage clone and subcloned 
into Sacl/Smal digested pUC18. The resulting plasmid was 
designated pUC18 J H . The neomycin resistance gene (neo) , used 
5 for drug selection of transfected ES cells, was derived from a 
repaired version of the plasmid pGEM7 (KJ1) . A report in the 
literature (Yenofsky et al. (1990) Proc. Nat l. Acad. Sci. 
(U.S.A. ) 87 : 3435-3439) documents a point mutation the neo 
coding sequences of several commonly used expression vectors, 

10 including the construct pMClneo (Thomas and Cappechi (1987) 
Cell 51 : 503-512) which served as the source of the neo gene 
used in pGEM7 (KJ1) . This mutation reduces the activity of 
the neo gene product and was repaired by replacing a 
restriction fragment encompassing the mutation with the 

15 corresponding sequence from a wild-type neo clone. The 

Hindlll site in the prepared pGEM7 (KJ1) was converted to a 
Sail site by addition of a synthetic adaptor, and the neo 
expression cassette excised by digestion with Xbal/Sall. The 
ends of the neo fragment were then blunted by treatment with 

20 the Klenow form of DNA poll, and the neo fragment was 

subcloned into the Nael site of pUC18 J H# generating the 
plasmid pUC18 J H -neo (Fig. 21b) . 

Further construction of the targeting vector was 
carried out in a derivative of the plasmid pGPlb. pGPlb was 

25 digested with the restriction enzyme NotI and ligated with the 
following oligonucleotide as an adaptor: 

5»- GGC CGC TCG ACG ATA GCC TCG AGG CTA TAA ATC TAG AAG AAT 
TCC AGC AAA GCT TTG GC -3 1 

30 

The resulting plasmid, called pGMT, was used to 
build the mouse immunoglobulin heavy chain targeting 
construct. 

The Herpes Simplex Virus (HSV) thymidine kinase (TK) 
35 gene was included in the construct in order to allow for 
enrichment of ES clones bearing homologous recombinants, as 
described by Mansour et al. ( Nature 336 , 348-352 (1988)). The 
HSV TK gene was obtained from the plasmid pGEM7 (TK) by 
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digestion with EcoRI and Hindlll. The TK DNA fragment was 
subcloned between the EcoRI and Hindlll sites of pGMT, 
creating the plasmid pGMT-TK (Fig. 2lc) . 

To provide an extensive region of homology to the 
target sequence, a 5.9 kb genomic Xbal/Xhol fragment, situated 
5 . of the J H region, was derived from a positive genomic phage 
clone by limit digestion of the DNA With Xhol, and partial 
digestion with Xbal. As noted in Fig. 21a, this Xbal site is 
not present in genomic DNA, but is rather derived from phage 
sequences immediately flanking the cloned genomic heavy chain 
insert in the positive phage clone. The fragment was 
subcloned into Xbal/Xhol digested pGMT-TK, to generate the 
plasmid pGMT-TK-J H 5 • (Fig. 21d) . 

The final step in the construction involved the 
excision from P UC18 J H -neo of the 2.8 kb EcoRI fragment which 
contained the neo gene and flanking genomic sequences 3 • of 
J H . This fragment was blunted by Klenow polymerase and 
subcloned into the similarly blunted Xhol site of 
pGMT-TK- J H 5 ' • The resulting construct, J H K01 (Fig. 21e) , 
contains 6.9 kb of genomic sequences flanking the J H locus, 
with a 2.3 kb deletion spanning the J H region into which has 
been inserted the neo gene. Fig. 21f shows the structure of 
an endogenous heavy chain gene after homologous recombination 
with the targeting construct. 



EXAMPLE 11 

^nn^Hnn and analysis of targeted ES ce^ls 

AB-1 ES cells (McMahon and Bradley, Cell 
62.: 1073-1085 (1990)) were grown on mitotically inactive 

30 SNL76/7 cell feeder layers essentially as described 
• (Robertson, E.J. (1987) Teratocarc^nmas and Embryonic Stem 
ons: A pr"^f«i Approach . E.J. Robertson, ed. (Oxford: IRL 
Press), pp. 71-112). As described in the previous example, 
prior to electroporation of ES cells with the targeting 

35 construct J H K01, the pluripotency of the ES cells was 

determined by generation of AB-l derived chimeras which were 
shown capable of germline transmission of the ES genome. 
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The heavy chain inactivation vector J H K01 was 
digested with NotI and electroporated into AB-1 cells by the 
methods described (Hasty et al., Nature 350:243-246 (1991)). 
Electroporated cells were plated into 100 mm dishes at a 
5 density of 1-2 x 10 6 cells/dish. After 24 hours, G418 

(200mg/ml of active component) and FIAU (0.5mM) were added to 
the medium, and drug-resistant clones were allowed to develop 
over 8-10 days. Clones were picked, trypsinized, divided into 
two portions, and further expanded. Half of the cells derived 
10 from each clone were then frozen and the other half analyzed 
for homologous recombination between vector and target 
sequences. 

DNA analysis was carried out by Southern blot 
hybridization. DNA was isolated from the clones as described 

15 (Laird et al. (1991) Nucleic Acids Res. 19 : 4293), digested 
with Stul and probed with the 500 bp EcoRI/StuI fragment 
designated as probe A in Fig- 2 If . This probe detects a Stul 
fragment of 4.7 kb in the wild-type locus, whereas a 3 kb band 
is diagnostic of homologous recombination of endogenous 

20 sequences with the targeting vector (see Fig. 21a and f) . Of 
525 G418 and FIAU doubly-resistant clones screened by Southern 
blot hybridization, 12 were found to contain the 3 kb fragment 
diagnostic of recombination with the targeting vector. That 
these clones represent the expected targeted events at the J H 

25 locus (as shown in Fig. 21f) was confirmed by further 

digestion with Hindlll, Spel and Hpal. Hybridization of probe 
A (see Fig. 21f) to Southern blots of Hindlll, Spel, and Hpal 
digested DNA produces bands of 2.3 kb, >10 kb, and >10kb, 
respectively, for the wild-type locus (see Fig. 21a) , whereas 

30 bands of 5.3 kb, 3.8 kb, and 1.9 kb, respectively, are 

expected for the targeted heavy chain locus (see Fig 21f ) . 
s All 12 positive clones detected by the Stul digest showed the 

predicted Hindlll , Spel , and Hpal bands diagnostic of a 
targeted J H gene. In addition, Southern blot analysis of a 

35 Stul digest of all 12 clones using a neo-specific probe (probe 
B f Fig. 2 If) generated only the predicted fragment of 3 kb, 
demonstrating that the clones each contained only a single 
copy of the targeting vector. 
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generation, of m J ™» carrying the J M deletion 

Three of the targeted ES clones described in the 
previous section were thawed and injected into C57BL/6J 
blastocysts as described (Bradley, A. (1987) in 
5 TPi-atocarciT^Tn^ and Em^ynnin Stein Cells: A Practical , 

Abroach . E.J. Robertson, ed. (Oxford: IRL Press) , p. 113-151) 
and transferred into the uteri of pseudopregnant females. The 
extent of ES cell contribution^:© the chimera was visually 
estimated from the amount of agfcuti coat coloration, derived 

10 from the ES cell line, on the^black C57BL/6J background. Half 
of the offspring resulting from blastocyst injection of two of 
the targeted clones were chimeric (i.e., showed agouti as well 
as black pigmentation) ; the third targeted clone did not 
generate' any chimeric animals. The majority of the chimeras 

15 showed significant (approximately 50 percent or greater) ES 
cell contribution to coat pigmentation. Since the AB-1 ES 
cells are an XY cell line"," most of the chimeras were male, due 
to sex conversion of female embryos colonized by male ES 
cells. Males chimeras were bred with C57BL/6J females and the 

20 offspring monitored for the presence of the dominant agouti 
coat color indicative of germline transmission of the ES 
genome. Chimeras from both of the clones consistently 
generated agouti offspring. Since only one copy of the heavy 
chain locusts targeted in the injected ES clones, each 

25 agouti pup had a 50 percent chance of inheriting the mutated 
locus, screening for the targeted gene was carried out by 
Southern blot analysis of stul-digested DNA from tail 
biopsies, using the probe utilized in identifying targeted ES 
clones (probe A, Fig. 21f ) . As expected, approximately 50 

30 percent of the agouti offspring showed a hybridizing StuI band 
of approximately 3 kb in addition to the wild-type banc f 4.7 
kb, demonstrating germline transmission of the targeted -» H 

gene segment. 

In order to generate mice homozygous for the 
35 mutation, heterozygotes were bred together and the heavy chain 
genotype of the offspring determined as described above. As 
expected, three genotypes were derived from the heterozygote 
matings: wild-type mice bearing two copies of the normal J H 
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locus, hetero zygotes caring one targeted copy of the gene and 
one normal copy, and mice homozygous for the J H mutation. The 
absence of J H sequences from these latter mice was verified by 
hybridization of the Southern blots of stul-digested DNA with 
5 a probe specific for J H (probe C, Fig. 21a). Whereas 

hybridization of the J H probe to a 4.7 kb fragment in DNA 
samples from heterozygous and wild-type siblings was observed, 
no signal was present in samples from the J H -mutant 
homozygotes, attesting to the generation of a novel mouse 
10 strain in which both copies of the heavy chain gene have been 
mutated by deletion of the J H sequences. 

EXAMPLE 12 
Heavy Chain Mini locus Transgene 
15 A. Construction of pla smid vectors for cloning large DNA 
sequences 
1. pGPla 

The plasmid pBR322 was digested with EcoRI and Styl 
and ligated with the following oligonucleotides: 

20 

oligo-42 5'- caa gag ccc gcc taa tga gcg ggc ttt ttt ttg cat 
act gcg gcc get -3' 

oligo-43 5'- aat tag egg ccg cag tat gca aaa aaa age ccg etc 
25 att agg egg get -3' 

The resulting plasmid, pGPla, is designed for 
cloning very large DNA constructs that can be excised by the 
.30 rare cutting restriction enzyme Notl. It contains a NotI 
restriction site downstream (relative to the ampicillin 
resistance gene, AmpR) of a strong transcription termination 
signal derived from the trpA gene (Christie et al., Proc. 
Natl. Acad. Sci. USA 78:4180 (1981)). This termination signal 
35 reduces the potential toxicity of coding sequences inserted 
into the Notl site by eliminating readthrough transcription 
from the AmpR gene. In addition, this plasmid is low copy 
relative to the pUC plasmids because it retains the pBR322 
copy number control region. The low copy number further 
40 reduces the potential toxicity of insert sequences and reduces 
the selection against large inserts due to DNA replication. 
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The vectors pGPlb, pGPlc, pGPld, and pGPlf are derived from 
pGPla and contain different polylinker cloning sites. The 
polylinker sequences are given below 



5 

pGPla 



NotI 
GCGGCCGC 



10 



15 



20 



pGPlb 

NotI Xhol Clal BamHI Hindlll NotI 

GCggccgcctcgagatcactatcgattaattaaggatccagcagtaagcttgcGGCCGC 



pGIlC 

NotI Smal Xhol Sail Hindlll BamHI SacII NotI 
GCggccgcatcccgggtctcgaggtcgacaagctttcgaggatccgcGGCCGC 



pGPld 

2j NotI Sail Hindlll Clal BamHI Xhol Not * 
GCggccgctgtcgacaagcttatcgatggatcctcgagtgcGGCCGC 

30 pGPlf 

NotI Sail Hindlll EcoRI Clal Kpnl BamHI Xhol NotI 
GCggccgctgtcgacaagcttcgaattcagatcgatgtggtacctggatcctcgagtgcGGCCGC 

35 Each of these plasmids can be used for the construction of 
large transgene inserts that are excisable with NotI so that 
the transgene DNA can be purified away from vector sequences 
prior to microinjection. 

40 2. pGPlb 

' pGPla was digested with NotI and ligated with the 

following oligonucleotides? 

oligo-47 5'- ggc cgc aag ctt act get gga tec tta att aat cga 
45 " tag tga tct cga ggc -3* 

oligo-48 5«- ggc cgc etc gag ate act ate gat taa tta agg ate 
cag cag taa get tgc -3' 
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The resulting plasmid, pGPlb, contains a short 
polylinker region flanked by NotI sites. This facilitates the 
construction of large inserts that can be excised by NotI 
digestion. 

5 

3. pGPe 

The following oligonucleotides: 

oligo-44 5'- etc cag gat cca gat ate agt acc tga aac agg get 
10 tgc -3 • 

oligo-45 5'- etc gag cat gca cag gac ctg gag cac aca cag cct 
tec -3 ' 

15 

were used to amplify the immunoglobulin heavy chain 3 ' 
enhancer (S. Petterson, et al., Nature 344. : 165-168 (1990)) 
from rat liver DNA by the polymerase chain reaction technique. 

The amplified product was digested with BamHI and 

20 SphI and cloned into BamHI/SphI digested pNN03 (pNN03 is a pUC 
derived plasmid that contains a polylinker with the following 
restriction sites, listed in order: NotI, BamHI, Ncol, Clal, 
EcoRV, Xbal, SacI, Xhol, SphI, PstI, Bglll, EcoRI, Smal, Kpnl, 
Hindlll, and NotI). The resulting plasmid, pRE3, was digested 

25 with BamHI and Hindlll, and the insert containing the rat Ig 
heavy chain 3« enhancer cloned into BamHI /Hindlll digested 
pGPlb. The resulting plasmid, pGPe (Fig. 22 and Table 1) , 
contains several unique restriction sites into which sequences 
can be cloned and subsequently excised together with the 3' 

30 enhancer by NotI digestion. 

» 



WO 93/12227 PCT/US92/10983 

90 



IABLJL1 




tsssK=scocKcececcec5CKssscqeKCcssqccc^ 
— . — eacacacacacacacacacacacacacaccccccaaqtgactcaccctstgcaoggcxcgc-- 

S£e«Sie«eefle^^ 
55»£cec15c»cc^ 



ill ii III WMMi 



Sequence of vector pGPe. 
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B. Construction of IcrM expre ssing minilocus transgene. PlGMl 

1. Isolation of J-u constant region clones and construction 
of pJMl 

5 A human placental genomic DNA library cloned into 

the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, Inc., 
Palo Alto, CA) was screened with the human heavy chain J 
region specific oligonucleotide: 

10 oligo-1 5 1 - gga ctg tgt ccc tgt gtg atg ctt ttg atg tct ggg 
gcc aag -3 1 

and the phage clone \1.3 isolated. A 6 kb Hindlll/Kpnl 
15 fragment from this clone, containing all six J segments as 
well as D segment DHQ52 and the heavy chain J-m intronic 
enhancer, was isolated. The same library was screened with 
the human /i specific oligonucleotide: 

20 oligo-2 5'- cac caa gtt gac ctg cct ggt cac aga cct gac cac 
eta tga -3 • 

and the phage clone X2.1 isolated. A 10.5 kb Hindlll/Xhol 
25 fragment, containing the m switch region and all of the m 
constant region exons, was isolated from this clone. These 
two fragments were ligated together with KpnI/XhoI digested 
pNN03 to obtain the plasmid pJMl. 

30 2. PJM2 

A 4 kb Xhol fragment was isolated from phage clone 
X2.1 that contains sequences immediately downstream of the 
sequences in pJMl, including the so called E/x element involved 
in ^-associated deleteon of the /u in certain IgD expressing 

35 B-cells (Yasui et al., Eur. J. Immunol . 19:1399 (1989), which 
is incorporated herein by reference) . This fragment was 
treated with the Klenow fragment of DNA polymerase I and 
ligated to Xhol cut, Klenow treated, pJMl. The resulting 
plasmid, pJM2 (Fig. 23), had lost the internal Xhol site but 

40 retained the 3' Xhol site due to incomplete reaction by the 
Klenow enzyme. pJM2 contains the entire human J region, the 
heavy chain J-m intronic enhancer, the n switch region and all 
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of the fi constant region exons, as well as the two 0.4 kb 
direct repeats, *M and Em, involved in ^-associated deletion 
of the n gene. 

5 3 . r en1a Hon of P ™ clones and construction of pDHl 
The following human D region specific 
oligonucleotide: 

oligo-4 5'- tgg tat tac tat ggt teg ggg agt tat tat aac cac 
10 agt gtc -3' 



was used to screen the human placenta genomic library for D 
region clones. Phage clones X4.1 and X4.3 were isolated. A 
15 5.5 kb Xhol fragment, that includes the D elements D K1 , D H1 , 

. > * FMnn t 7*4141 (1988)), was isolated 

and Djj2 (Ichihara et al., EMBO_J. 7.4i4i 

from phage clone X4.1. An adjacent upstream 5.2 kb Xhol 
fragment, that includes the D elements D^, D m , D^, and 
D A1/ was isolated from phage clone X4.3. Each of these D 

20 region Xhol fragments were cloned into the Sail site of the 
plasmid vector P SF72 (Promega, Madison, WI) so as to destroy 
the Xhol site linking the two sequences. The upstream 
fragment was then excised with Xhol and Smal, and the 
downstream fragment with EcoRV and Xhol. The resulting 

25 isolated fragments were ligated together with Sail digested 
P SP72 to give the plasmid pDHl. pDHl contains a 10.6 kb 
insert that includes at least 7 D segments and can be excised 
with Xhol (5«) and EcoRV (3'J. 

30 4. pCQRl 

The plasmid pJM2 was digested with Asp718 (an 
isoschizomer of.Kpnl) and the overhang filled in with the 
Klenow fragment of DMA polymerase I. The resulting DNA was 
then digested with Clal and the insert isolated. This insert 
35 was ligated to the XhoI/EcoRV insert of pDHl and Xhol/Clal 
digested pGPe to generate pCORl (Fig. 24). 

5. pVH2Sl . . 

A 10.3 kb genomic Hindlll fragment containing the 

40 two human heavy chain variable region segments V H 251 and V H 105 
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(Humphries et al., Nature 331;446 (1988), which is 
incorporated herein by reference) was subcloned into pSP72 to 
give the plasmid pVH251. 

5 6. PlGMl 

The plasmid pCORl was partially digested with Xhol 
and the isolated Xhol /Sail insert of pVH251 cloned into the 
upstream Xhol site to generate the plasmid pIGMl (Fig. 25) . 
pIGMl contains 2 functional human variable region segments, at 

10 least 8 human D segments all 6 human J H segments, the human 
J-M enhancer, the human ay. element, the human m switch region, 
all of the human n coding exons, and the human E/x element, 
together with the rat heavy chain 3' enhancer, such that all 
of these sequence elements can be isolated on a single 

15 fragment, away from vector sequences, by digestion with NotI 
and micro injected into mouse embryo pronuclei to generate 
transgenic animals. 

C. Construction of IaM and IcrG expre ssing minilocus 
20 transgene . pHCl 

1. Isolation of y constant region clones 

The following oligonucleotide, specific for human Ig 
g constant region genes: 

25 oligo-29 5'- cag cag gtg cac acc caa tgc cca tga gcc cag aca 
ctg gac -3' 

was used to screen the human genomic library. Phage clones 
129.4 and \29.5 were isolated. A 4 kb Hindlll fragment of 

30 phage clone X29.4, containing a y switch region, was used to 
probe a human placenta genomic DNA library cloned into the 
phage vector lambda FIX™ II (Stratagene, La Jolla, CA) . Phage 
clone XSgl.13 was isolated. To determine the subclass of the 
different 7 clones, dideoxy sequencing reactions were carried 

35 out using subclones of each of the three phage clones as 
templates and the following oligonucleotide as a primer: 

oligo-67 5'- tga gcc cag aca ctg gac -3' 
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Phage clones \29.5 and \S 7 1.13 were both determined 
to be of the 7I subclass. 



2. pvel 

A 7.8 kb Hindlll fragment of phage clone X29.5, 
containing the T i. coding region was cloned into pUC18. The 
resulting plasmid, pLTl, was digested with Xhol, Klenow 
treated, and religated to destroy the internal Xhol site. The 
resulting clone, pLTlxk, was digested with Hindlll and the 
insert isolated and cloned into pSP72 to generate the plasmid 
clone pLTlxks. Digestion of pLTlxks at a polylinker Xhol site 
and a human sequence derived BamHI site generates a 7.6 kb 
fragment containing the yl constant region coding exons. This 
7.6 kb XhoI/BamHI fragment was cloned together with an 
adjacent downstream 4.5 kb BamHI fragment from phage clone 
X29.5 into XhoI/BamHI digested pGPe to generate the plasmid 
clone p 7 el. P7 el contains all of the yl constant region 
coding exons, together with 5 kb of downstream sequences, 
linked to the rat heavy chain 3' enhancer. 



3. p 7 e2 

A 5.3 kb Hindlll fragment containing the 7I switch 
region and the first exon of the pre-switch sterile transcript 
(p Sideras et al. (1989) Tnternational Immunol. -1/ 631) was 
isolated from phage clone XS 7 1.13 and cloned into P SP72 with 
the polylinker Xhol site adjacent to the 5- end of the insert, 
to generate the plasmid clone pS T ls. The Xhol/Sall insert of 
pS 7 ls was cloned into Xhol digested p 7 el to generate the 
plasmid clone p 7 e2 (Fig. 26) . P7 e2 contains all of the 7 1 
constant region coding exons, and the upstream switch region 
and -sterile transcript exons, together with 5 kb of downstream 
sequences, linked to the rat heavy chain 3' enhancer. This 
clone contains a unique Xhol site at the 5- end of the insert. 
The entire insert, together with the Xhol site and the 3» rat 
enhancer can be excised from vector sequences by digestion 
with Notl. 



WO 93/12227 



PCI7US92/10983 



95 

4. pHCI 

The plasmid pIGMl was digested with Xhol and the 43 
kb insert isolated and cloned into Xhol digested pge2 to 
generate the plasmid pHCI (Fig. 25) . pHCI contains 2 
5 functional human variable region segments, at least 8 human D 
segments all 6 human J H segments, the human J-n enhancer, the 
human an element, the human n switch region, all of the human 
H coding exons, the human Em element, and the human 71 
constant region, including the associated switch region and 
10 sterile transcript associated exons, together with the rat 
heavy chain 3' enhancer, such that all of these sequence 
elements can be isolated on a single fragment, away from 
vector sequences, by digestion with NotI and microinjected 
into mouse embryo pronuclei to generate transgenic animals. 

15 

D. Construction of IaM and IaG exp ressing minilocus 
transaene . pHC2 

1. Isolation of human heavy chain V region gene VH49.8 

The human placental genomic DNA library lambda, FIX 1 " 
20 II, Stratagene, La Jolla, CA) was screened with the following 
human VH1 family specific oligonucleotide: 

oligo-49 5«- gtt aaa gag gat ttt att cac ccc tgt gtc etc tec 
aca ggt gtc -3' 

25 

Phage clone X49.8 was isolated and a 6.1 kb Xbal 
fragment containing the variable segment VH49.8 subcloned into 
pNN03 (such that the polylinker Clal site is downstream of 
30 VH49.8 and the polylinker Xhol site is upstream) to generate • 
the plasmid pVH49.8. An 800 bp region of this insert was 
sequenced, and VH49.8 found to have an open reading frame and 
intact splicing and recombination signals, thus indicating 
that the gene is functional (Table 2) . 
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TABLE 2 



GCTT3STCT3 TCnGCAICCC ?S2C7ZZ22Z 



cr^isAAm crrGccr^. :xskecgc^ aataaicts. gst 

SBaaHaifc GaaanEGS tscccts^ gcaicscma joacg&st 



MstAsoTrpff 



^5 ^^^^^^^^^^ 



O^ ^jj CTTCrrrGTS CTSCAGgtaa ggggcstccc 

5^rAraPh eLsuPneVal ValAlaAlaA laThr 

kgcc==aagg c==agcaagg carccnggcc ^gcraaaga ggatrr^awu 

„ c =C2C aaGIGICSG TCCCSQGICC MCIGSIGCa 

GlyValGln SerGlnValG lnLsuValGl 
GICIG3332I GRGGIGRfiSl AGOCIGGGTC CTOGSTGRAG CTCTOCIQC& 
nSerdyAla GluValLvsL ysPrcdySe rSerValLys Va^erCysL 
aBGCTTCTQG flSTRCCITC MCMCTATO CIR30SID3 GGK30GRCAG 
vsAlaSerd vdvThrPhe SerSerTyiA lalleSerTr pV alArgGln 
'r^rrrrm r. MSSETTGR. G lldb A lU ^ k AGSaiCaiO: CmiCCTGS 
AlaProGlyG IzaSlyLeuGl uTtpMetGly ArgllelleP roI le l ffiGI 

vllfiftlaftsn TyrAlaGUiL ysPheGlnGL yArgValThr IleThrAlaA 

mAAirac qgsogoc Traroas: raocRGOcr GasrciGaG 

spLvsSerTh rSerThrftla Tyrt^GluLeuSerSerla uArgSerGLu 
QOCS30DS TGIKTEOG TOOGRGaGRtLSCBGESTCrtA AfiOCCJOIC 

AsriPuAl aV alTvrTvrC y sAlaArg 

^ ■ — ^ RftG GCftQCIGIGC CQQGCIGBQG 



■-50 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 



flSflXSQGG GmErn&s TTIMG3ZIG ^UOAAfil GGGTBfflfflV 800 



812 



Seq uence cfhuman family gene V H 49.8 
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2. PV2 

A 4 kb Xbal genomic fragment containing the human 
V H IV family gene V H 4-21 (Sanz et al., EMBO J . . 8:3741 (1989)), 
subcloned into the plasmid pUC12, was excised with Smal and 
5 Hindlll, and treated with the Klenow fragment of polymerase I. 
The blunt ended fragment was then cloned into Clal digested, 
Klenow treated, pVH49.8. The resulting plasmid, pV2, contains 
the human heavy chain gene VH49.8 linked upstream of VH4-21 in 
the same orientation, with a unique Sail site at the 3' end of 
10 the insert and a unique Xhol site at the 5 1 end. 

3. PSvl>5 ! 

A 0.7 kb Xbal/Hindlll fragment (representing 
sequences immediately upstream of, and adjacent to, the 5.3 kb 

15 71 switch region containing fragment in the plasmid p?e2) 
together with the neighboring upstream 3.1 kb Xbal fragment 
were isolated from the phage clone XSgl.13 and cloned into 
Hindlll/Xbal digested pUC18 vector- The resulting plasmid, 
PS7I-5 • , contains a 3.8 kb insert representing sequences 

20 upstream of the initiation site of the sterile transcript 
found in B-cells prior to switching to the 71 isotype (P. 
Sideras et al., International Immunol, 1:631 (1989)). Because 
the transcript is implicated in the initiation of isotype 
switching, and upstream cis-acting sequences are often 

25 important for transcription regulation, these sequences are 
included in transgene constructs to promote correct expression 
of the sterile transcript and the associated switch 
recombination. 

30 4. pVGEl 

The PS7I-5 1 insert was excised with Smal and 
Hindlll, treated with Klenow enzyme, and ligated with the 
following oligonucleotide linker: 

35 5'- ccg gtc gac egg -3 1 

The ligation product was digested with Sail and ligated to 
Sail digested pV2. The resulting plasmid, pVP, contains 3.8 
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kb of 7 1 switch 5' flanking sequences linked downstream of the 
two human variable gene segments VH49.8 and VH4-21 (see Table 
2). The pVP insert is isolated by partial digestion with Sail 
and complete digestion with Xhol, followed by purification of 
5 the 15 kb fragment on an agarose gel. The insert is then 
cloned into the Xhol site of P7 e2 to generate the plasmid 
clone pVGEl (Fig. 27). pVGEl contains two human heavy chain 
variable gene segments upstream of the human 71 constant gene 
and associated switch region. A unique Sail site between the 
10 variable and constant regions can be used to clone in D, J, 
and fi gene segments. The rat heavy chain 3« enhancer is 
linked to the 3- end of the 7 1 and the entire insert is 

flanked by NotI sites. 

15 5. pHC2 

The plasmid clone pVGEl is digested with Sail and 
the Xhol insert of pIGMl is cloned into it. The resulting 
clone, P HC2 (Fig. 25), contains 4 functional human variable 
region segments, at least 8 human D segments all 6 human J H 

20 segments, the human J-m enhancer, the human an element, the 
human n switch region, all of the human m coding exons, the 
human Zfi element, and the human 71 constant region, including 
the associated switch region and sterile transcript associated 
exons, together with 4 kb flanking sequences upstream of the 

25 sterile transcript initiation site. These human sequences are 
linked to the rat heavy chain 3- enhancer, such that all of 
the sequence elements can be isolated on a single fragment, 
away from vector sequences, by digestion with NotI and 
microinjected into mouse embryo pronuclei to generate 

30 transgenic animals. A unique Xhol site at the 5' end of the 
insert can be used to clone in additional human variable gene 
segments to further expand the recombinational diversity of 
this heavy chain minilocus. 

35 e. Transgen ic mice 

The NotI inserts of piasmids pIGMl and pHCl were 
isolated from vector sequences by agarose gel electrophoresis. 
The purified inserts were microinjected into the pronuclei of 
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fertilized (C57BL/6 x CBA) F2 mouse embryos and transferred the 
surviving embryos into pseudopregnant females as described by 
Hogan et al. (B. Hogan, F. Costantini, and E. Lacy, Methods of 
Manipulating the Mouse Embryo, 1986, Cold Spring Harbor 
5 Laboratory, New York) . Mice that developed from injected 

embryos were analyzed for the presence of transgene sequences 
by Southern blot analysis of tail DNA. Transgene copy number 
was estimated by band intensity relative to control standards 
containing known quantities of cloned DNA. At 3 to 8 weeks of 

10 age, serum was isolated from these animals and assayed for the 
presence of transgene encoded human IgM and IgGl by ELISA as 
described by Harlow and Lane (E. Harlow and D. Lane. 
Antibodies: A Laboratory Manual, 1988, Cold Spring Harbor 
Laboratory, New York) . Microtiter plate wells were coated 

15 with. mouse monoclonal antibodies specific for human IgM (clone 
AF6, #0285, AMAC, Inc. Westbrook, ME) and human IgGl (clone 
JL512, #0280, AMAC, Inc. Westbrook, ME). Serum samples were 
serially diluted into the wells and the presence of specific 
immunoglobulins detected with affinity isolated alkaline 

20 phosphatase conjugated goat anti-human Ig (polyvalent) that 
had been pre-adsorbed to minimize cross-reactivity with mouse 
immunoglobulins. Table 3 and Fig. 28 show the results of an 
ELISA assay for the presence of human IgM and IgGl in the 
serum of two animals that developed from embryos injected with 

25 the transgene insert of plasmid pHCl. All of the control non- 
transgenic mice tested negative for expression of human IgM 
and IgGl by this assay. Mice from two lines containing the 
pIGMl NotI insert (lines #6 and 15) express human IgM but not 
human IgGl. We tested mice from 6 lines that contain the pHCl 

30 insert and found that 4 of the lines (lines #26, 38, 57 and 
122) express both human IgM and human IgGl, while mice from 
two of the lines (lines #19 and 21) do not express detectable 
levels of human immunoglobulins. The pHCl transgenic mice 
that did not express human immunoglobulins were so-called G e 

35 mice that developed directly from micro injected embryos and 
may have been mosaic for the presence of the transgene. 
Southern blot analysis indicates that many of these mice 
contain one or fewer copies of the transgene per cell. The 
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and 



detection of human IgM in the serum of pIGMl transgenics 
human IgM and IgGl in pHCl transgenics, provides evidence that 
the transgene sequences function correctly in directing VDJ 
joining, transcription, and isotyp switching. One of the 
5 animals (#18) was negative for the transgene by Southern blot 
analysis, and showed no detectable levels of human IgM or 
IgGl. The second animal (#38) contained approximately 5 
copies of the transgene, as assayed by Southern blotting, and 
showed detectable lev* s of both human IgM and IgGl. The 
10 results of ELISA assays for 11 animals that developed from 
transgene injected embryos is summarized in the table below 
(Table 3) . 

TABLE 3 



15 


Detection of human IgM 
animals by ELISA assay 


and IgGl in the 


serum of transgenic 


20 


animal # 


approximate 
iniected transgene 
transaene ™ P i*s ner cell human I_gl5 human IqGl 




6 


pIGMl 


1 


+ + 


25 


7 


pIGMl . 


0 






9 


pIGMl 


0 






10 


pIGMl 


0 




30 












12 


pIGMl 


0 






15 


pIGMl 


10 


+ + 


35 


18 


pHCl 


0 






19 


pHCl 


1 






21 


pHCl 


<1 




40 


26 


pHCl 


2 


+ + + 




38 


pHCl 


5 


+ + + 
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Table 3 shows a correlation between the presence of 
integrated transgene DNA and the presence of transgene encoded 
immunoglobulins in the serum. Two of the animals that were 
5 found to contain the pHCl transgene did not express detectable 
levels of human immunoglobulins. These were both low copy 
animals and may not have contained complete copies of the 
transgenes, or the animals may have been genetic mosaics 
(indicated by the <l copy per cell estimated for animal #21) , 

10 and the transgene containing cells may not have populated the 
hematopoietic lineage. Alternatively, the transgenes may have 
integrated into genomic locations that are not conducive to 
their expression. The detection of human IgM in the serum of 
pIGMl transgenics, and human IgM and IgGl in pHCl transgenics, 

15 indicates that the transgene sequences function correctly in 
directing VDJ joining, transcription, and isotype switching. 

F. cDNA clones 

To assess the functionality of the pHCl transgene in 

20 VDJ joining and class switching, as well the participation of 
the transgene encoded human B-cell receptor in B-cell 
development and allelic exclusion, the structure of 
immunoglobulin cDNA clones derived from transgenic mouse 
spleen mRNA were examined. The overall diversity of the 

25 transgene encoded heavy chains, focusing on D and J segment 
usage, N region addition, CDR3 length distribution, and the 
frequency of joints resulting in functional mRNA molecules was 
examined. Transcripts encoding IgM and IgG incorporating 
VH105 and VH251 were examined. 

30 Polyadenylated RNA was isolated from an eleven week 

old male second generation line-57 pHCl transgenic mouse. 
This RNA was used to synthesize oligo-dT primed single 
stranded cDNA. The resulting cDNA was then used as template 
for four individual PCR amplifications using the following 

35 four synthetic oligonucleotides as primers: VH251 specific 
oligo-149, eta get cga gtc caa gga gtc tgt gee gag gtg cag ctg 
(g,a,t,c); VH105 specific O-150, gtt get cga gtg aaa ggt gtc 
cag tgt gag gtg cag ctg (g,a,t,c); human gammal specific 
oligo-151, ggc get cga gtt cca cga cac cgt cac egg ttc; and 
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human mu specific oligo-152, cet get cga ggc age caa egg cca 
cgc tgc teg. Reaction 1 used primers 0-149 and o-151 to 
amplify VH251-gammal transcripts, reaction 2 used o-149 and o- 
152 to amplify VH251-mu transcripts, reaction 3 used o-150 and 
c-151 to amplify VHlOS-gammal transcripts, and reaction 4 used 
c-150 and o-152 to amplify VH105-mu transcripts. The 
resulting 0.5 kb PGR products were isolated from an agarose 
gel,- the p transcript products were more abundant than the y 
transcript products, consistent with the corresponding ELISA 
data (Fig. 34) . The PCR products were digested with Xhol and 
cloned into the plasmid P NN03. Double-stranded plasmid DNA 
was isolated from minipreps of nine clones from each of the 
four PCR amplifications and dideoxy sequencing reactions were 
performed. Two of the clones turned out to be deletions 
containing no D or J segments. These could not have been 
derived from normal RNA splicing products and are likely to 
have originated from deletions introduced during PCR 
amplification. One of the DNA samples turned out to be a 
mixture of two individual clones, and three additional clones 
did not produce readable DNA sequence (presumably because the 
DNA samples were not clean enough) . The DNA sequences of the 
VDJ joints from the remaining 30 clones are compiled in Table 
4. Each of the sequences are unique, indicating that no 
single pathway of gene rearrangement, or single clone of 
transgene expressing B-cells is dominant. The fact that no 
two sequences are alike is also an indication of the large 
diversity of immunoglobulins that can be expressed from a 
compact minilocus containing only 2 V segments, 10 D segments, 
and 6 J segments. Both of the V segments, all six of the J 
segments, and 7 of the 10 D segments that are included in the 
transgene are used in VDJ joints. In addition, both constant 
region genes (mu and gammal) are incorporated into 
transcripts. The VH105 primer turned out not to be specific 
for VH105 in the reactions performed. Therefore many of the 
clones from reactions 3 and 4 contained VH251 transcripts. 
Additionally, clones isolated from ligated reaction 3 PCR 
product turned out to encode IgM rather than IgG; however this 
nay reflect contamination with PCR product from reaction 4 as 
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the DNA was isolated on the same gel. An analogous 
experiment, in which immunoglobulin heavy chain sequences were 
amplified from adult human peripheral blood lymphocytes (PBL) , 
and the DNA sequence of the VDJ joints determined, was 
recently reported by Yamada et al. (J. Exp, fled. 17.3:395-407 
(1991), which is incorporated herein by reference). We 
compared the data from human PBL with our data from the pHCl 
transgenic mouse. 
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G. J segment choice 

Table 5 compared the distribution of J segments 
incorporated into pHCl transgene encoded transcripts to J 
segments found in adult human PBL immunoglobulin transcripts. 
The distribution profiles are very similar, J4 is the dominant 
segment in both systems, followed by J6. J2 is the least 
common segment in human PBL and the transgenic animal. 

TABLE 5 J« Segment Choice 

Percent Usage (± 3%) 
J. Segment hci transgenic Human PBL 



Jl 7 1 

15 J2 3 <1 

. J3 17 9 

J4 44 53 

J5 3 15 

J6 26 _22_ 



100% 100% 



H. D segment choice 

25 49% (40 of 82) of the clones analyzed by Yamada et 

al. incorporated D segments that are included in the pHCl 
transgene. An additional 11 clones contained sequences that 
were not assigned by the authors to any of the known D 
segments. Two of these 11 unassigned clones appear to be 

30 derived from an inversion of the DIR2 segments which is 

included in the pHCl construct. This mechanism, which was 
predicted by Ichihara et al. f EMBO J . 7:4141 (1988)) and 
observed by Sanz f J. Immunol . 112:1720-1729 (1991)), was not 
considered by Yamada et al. f J. Exp. Med . 123:395-407 (1991)). 

35 Table 5 is a comparison of the D segment distribution for the 
pHCl transgenic mouse and that observed for human PBL 
transcripts by Yamada et al. The data of Yamada et al. was 
recompiled to include DIR2 use, and to exclude D segments that 
are not in the pHCl transgene. Table 6 demonstrates that the 

40 distribution of D segment incorporation is very similar in the 
transgenic mouse and in human PBL. The two dominant human D 
segments, DXP 1 ! and DN1, are also found with high frequency in 
the transgenic mouse. The most dramatic dissimilarity between 
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the two distributions is the high frequency of DHQ52 in the 
transgenic mouse as compared to the human. The high frequeno 
of DHQ52 is reminiscent of the D segment distribution in the 
human fetal liver. San, has observed that 14% of the heavy 
chain transcripts contained DHQ52 sequences. If D -«■"*• 
not found in pHCl are excluded from the analysis, 31- of the 
fetal transcripts analyzed by sanz contain DHC52. This is 
comparable to the 27% that we observe in the pHCl transgenic 
mouse. 



25 



TABLE 6 



n. Segment 

" DLR1 
DXP1 
DXP'l 
DAI 
DK1 
DN1 
DIR2 
DM2 
DLR2 
DHQ52 
? 



D Segment Choice 

percent Usage (±3*) 
wn tran sgenic 



Human PBL 



<1 

3 
25 
<1 

7 
12 

7 
<1 

3 
26 
17 



<1 
6 
19 
12 
12 
22 
4 
2 
4 
2 



100% 



100% 



30 I. Functional ity, sf YBJ -joints _ 
Table 7 shows the predicted amino acid sequences of 
the VDJ regions from 30 clones that were analyzed from the 
pHCl transgenic. The translated sequences xndxcate that 23 
the 30 VDJ joints (77%) are in-frame with respect to the 

35 variable and J segments. 



WO 93/12227 1Q7 PCT/US92/ 1 0983 

table 7 Functionality of v-d-j Joints 









FR3 


CDR3 FR4 


I 


YH231 


ZaQlZ J2 ti 


?OR 


riTCVOATOX WGQGTMVTVSSASTK 


1 


VH251 


ZUl J4 t» 


TOR 


HRIAMCTOT VCOCTLVTVSSASTK 


3 


VH231 


Z? Jo y\ 


YCAR 


vrrrrTGwv v«uu* ".tvssastx 


4 


VH211 


5XPU Jo M 


TOR 




5 


VK231 


axp , i Yt 


*OR 


RRXYGSGSYYWFDY WGOG7LVTVSSXSTK 


6 


VR2S1 


C? J3 yi 


TOR 


KCVSCAIUI MOQSZMVTVSSASTK 


7 


VH2S1 


0HQ52 J3 u 


TOR 


AIOffDX MGQGZMVTVSSGSftS 


6 


VR251 


ZHQS2 J 6 u 


TOW 


SMWBftTTTOCV VQQCRVTVSSCSRS 


9 


VH251 


Jl u 


TOR 


^TQfl WCOSXLVTVSSCSRS 


LO 


VH251 


CLR2 J4 u 


TOR 


HVXHSrmf WBQSIIOTVSSSSRS 




VR2S1 


CXfl J4 it 


tor 


QXTWRCWrox MG0S7LVTVSSSSAS 




VK251 0? Jl it 


TOR 


0TTQH inOCILVTVSSGSRS 


13 


VH251 0H032 J6 u 


tout 


axrarrow tnoGRmsssas 


14 


VK251 


oxr-i j« » 


TOR 




15 


VB291 


DXPU J4 71 


taut 


OGMsmanxxsuo 


:s 


7R103 


off 1 : js u 


TCVR 




i? 


VK251 


own J4 7* 


tew 


RRTTCSCSmVFDT MBQ67SVTVSSXSTK 


LB 


VB2S1 0HM2 J4 Tt 


TOR 


QTMQGDX NQQBXXiVTVSSRSXK 


19 


VH251 


on j* Tt 


TOR 


GXS6TDCTXYGXHV MGQ6ZTVTV5SASXK 


20 


VB25X 


08032 J4 n 


TOR 




21 


VB91 


on J2 n 


TOR 




22 


71251 


onu js 71 


TOR 


ASLfAJ DS XUMJV MUUU1TVTVSSX5TK 


23 


VB91 


0XB2 J4 it 


TOR 




24 


vnos 


07 Jt it 


TCW 




23 


vnos 


0»1 J4 |t 


XCVR 




26 


VH2S1 


on J3 1& 


TOR 




27 


vnos 


08052 J3 it 


TCW 




2S 




on j« tt 






29 


vnos 


on J4 it 


TCBC 





WO 93/12227 



PCT/US92/10983 



108 



j. rr>P3 lenot^ distrjfeaSiaa 

Table 8 compared the length of the CDR3 peptides 
from transcripts with in-frame VDJ joints in the pHCl 
transgenic mouse to those in human PBL. Again the human PBL 
5 data comes from Yamada et al. The profiles are similar with 
the transgenic profile skewed slightly toward smaller CDR3 
peptides than observed from human PBL, The average length of 
CDR3 in the transgenic mouse is 10.3 amino acids. This is 
substantially the same as the average size reported for 
10 authentic human CDR3 peptides by Sanz ( J. Immunol . 142:1720- 
1729 (1991)). 

TABLE 8 CDR3 Length Distribution 

15 percent Occurrence (± 3%) 

gamino tn CDR3 Hd transgenic fluman PBL 

38 48 41 

20 9-12 11 37 

19-23 <1 ^_ 



>23 



25 



100% i° 0% 



RYAMPLE 13 
30 P^rranaed w^aw Chain Transqenes 

A . T^-^inn of R— * »^an Hpbw Chain VD.T segments. 

Two human leukocyte genomic DNA libraries cloned 
into the phage vector XEMBL3 /SP6/T7 (Clonetech Laboratories, 
inc., Palo Alto, CA) are screened with a 1 kb PacI/HindlU 
35 fragment of Xl-3 containing the human heavy chain J-M intronic 
enhancer. Positive clones are tested for hybridization with a 
mixture of the following V H specific oligonucleotides: 

oligo-7 5'-tca.gtg aag gtt tec tgc aag gca tct gga tac acc 
40 ttc acc-3' 

oligo-8 5 '-tec ctg aga etc tec tgt gca gee tct gga ttc acc 
ttc ar • • 
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Clones that hybridized with both V and J-fi probes 
are isolated and the DNA sequence of the rearranged VDJ 
segment determined. 

5 . b. Construction of rearranged h uman heavy chain transgenes 
Fragments containing functional VJ segments (open 
reading frame and splice signals) are subcloned into the 
plasmid vector pSP72 such that the plasmid derived Xhol site 
is adjacent to the 5' end of the insert sequence. A subclone 

10 containing a functional VDJ segment is digested with Xhol and 
Pad (Pad, a rare-cutting enzyme, recognizes a site near the 
J-m intronic enhancer) , and the insert cloned into XhoI/PacI 
digested pHC2 to generate a transgene construct with a 
functional VDJ segment, the J-m intronic enhancer, the n 

15 switch element, the /i constant region coding exons, and the 7I 
constant region, including the sterile transcript associated 
sequences, the 71 switch, and the coding exons. This 
transgene construct is excised with NotI and microinjected 
into the pronuclei of mouse embryos to generate transgenic 

20 animals as described above. 

EXAMPLE 14 

T.iaht Cha in Transaenes 
A. construction of Plasmid vectors 
25 1. Plasmid vector pGPlc 

Plasmid vector pGPla is digested with NotI and the 
following oligonucleotides ligated in: 

oligo-81 5'-ggc cgc ate ccg ggt etc gag gtc gac aag ctt teg 
30 agg ate cgc-3' 

oligo-82 5«-ggc cgc gga tec teg aaa get tgt cga cct cga gac 
ccg gga tgc-3' 

35 The resulting plasmid, pGPlc, contains a polylinker with Xmal, 
Xhol, Sail, Hindlll, and BamHI restriction sites flanked by 
NotI sites. 
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2. Plasmid vector pGPld 

Plasmid vector pGPla is digested with NotI and the 

following oligonucleotides ligated in: 

oligo-87 5«-ggc cgc tgt cga caa get tat cga tgg ate etc gag 
tgc -3' 

oligo-88 5'-ggc cgc act cga gga tec ate gat aag ctt gtc gac 
age -3 1 

The resulting plasmid, pGPld, contains a polylinker with Sail, 
HindHI, Clal, BamHI, and Xhol restriction sites flanked by 
NotI sites. 

B. Tsolation n* J* and clones 

A human placental genomic DNA library cloned into 
the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, Inc., 
Palo Alto, CA) was screened with the human kappa light chain J 
region specific oligonucleotide: 

oligo-36 5'- cac ctt egg cca agg gac acg act gga gat taa acg 
taa gca-3' 

and the phage clones 136.2 and 136.5 isolated. A 7.4 kb Xhol 
fragment that includes the JkI segment was isolated from 
136.2 and subcloned into the plasmid pNNOS to generate the 
plasmid clone P 36.2. A neighboring 13 kb Xhol fragment that 
includes Jk segments 2 through 5 together with the Ck gene 
segment was isolated from phage clone 136.5 and subcloned into 
the plasmid pNN03 to generate the plasmid clone p36.5. 
Together these two clones span the region beginning 7.2 kb 
upstream of JkI and ending 9 kb downstream of Ck. 

C. cs n§trus£iS3 ^arranged light chain transgenes 

1. pCKl, a Ck vector for expressing rearranged variable 
segments 

The 13 kb Xhol insert of plasmid clone p36.5 
containing the Ck gene, together with 9 kb of downstream 
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sequences, is cloned into the Sail site of plasmid vector 
pGPlc with the 5' end of the insert adjacent to the plasmid 
Xhol site. The resulting clone, pCKl can accept cloned 
fragments containing rearranged VJk segments into the unique 
5' Xhol site. The transgene can then be excised with NotI and 
purified from vector sequences by gel electrophoresis. The 
resulting transgene construct will contain the human J-C/t 
intronic enhancer and may contain the human 3' k enhancer. 

2. pCK2, a Ck vector with heavy chain enhancers for 
expressing rearranged variable segments 

A 0.9 kb Xbal fragment of mouse genomic DNA 
containing the mouse heavy chain J-jx intronic enhancer (J. 
Banerji et al. r Cell 33:729-740 (1983)) was subcloned into 
pUC18 to generate the plasmid pJH22.1. This plasmid was 
linearized with SphI and the ends filled in with Klenow 
enzyme. The Klenow treated DNA was then digested with Hindlll 
and a 1.4 kb Mlul/Hindlll fragment of phage clone Xl.3 
(previous example) , containing the human heavy chain J-M 
intronic enhancer (Hayday et al., Nature 107:334-340 (1984)), 
to it. The resulting plasmid, pMHEl, consists of the mouse 
and human heavy chain J-n intronic enhancers ligated together 
into pUC18 such that they are excised on a single 
BamHI/Hindlll fragment. This 2.3 kb fragment is isolated and 
cloned into pGPlc to generate pMHE2. pMHE2 is digested with 
Sail and the 13 kb Xhol insert of p36.5 cloned in. The 
resulting plasmid, pCK2, is identical to pCKl, except that the 
mouse and human heavy chain J-m intronic enhancers are fused 
to the 3' end of the transgene insert. To modulate expression 
of the final transgene, analogous constructs can be generated 
with different enhancers, i.e. the mouse or rat 3* kappa or 
heavy chain enhancer (Meyer and Neuberger, EHBO J ., 
8:1959-1964 (1989); Petterson et al., Nature, 344:165-168 
(1990)) . 
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3 . isolation of rearranged kappa light chain variable 
segments 

Two human leukocyte genomic DNA libraries cloned 
into the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, 
inc., Palo Alto, CA) were screened with the human kappa light 
chain J region containing 3.5 kb Xhol/Smal fragment of P 36.5. 
Positive clones were tested for hybridization with the 
following V/c specific oligonucleotide: 

oligo-65 5'-agg ttc agt ggc agt ggg tct ggg aca gac ttc act 
etc acc ate age- 3' 

Clones that hybridized with both V and J probes are isolated 
and the DNA sequence of the rearranged VJk segment determined. 

4. Generation of transgenic mice containing rearranged human 

light chain constructs. 

Fragments containing functional VJ segments (open 
reading frame and splice signals) are subcloned into the 

20 unique Xhol sites of vectors pCKl and P CK2 to generate 
rearranged kappa light chain transgenes. The transgene 
constructs are isolated from vector sequences by digestion 
with Notl. Agarose gel purified insert is micr oin j ected into 
mouse embryo pronuclei to generate transgenic animals. 

25 Animals expressing human kappa chain are bred with heavy chain 
minilocus containing transgenic animals to generate mice 
expressing fully human antibodies. 

Because not all VJk combinations may be capable of 
forming stable heavy-light chain complexes with a broad 

30 spectrum of different heavy chain VDJ combinations, several 

different light chain transgene constructs are generated, each 
using a different rearranged VJk clone, and transgenic mice 
that result from these constructs are bred with heavy chain 
minilocus transgene expressing mice. Peripheral blood, 

35 spleen, and lymph node lymphocytes are isolated from double 
transgenic (both heavy and light chain constructs) animals, 
stained with fluorescent antibodies specific for human and 
mouse heavy and light chain immunoglobulins (Fharmingen, San 
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Diego, CA) and analyzed by flow cytometry using a FACScan 
analyzer (Becton Dickinson, San Jose, CA) . Rearranged light 
chain transgenes constructs that result in the highest level 
of human heavy/ light chain complexes on the surface of the 
5 highest number of B cells, and do not adversely affect the 
immune cell compartment (as assayed by flow cytometric 
analysis with B and T cell subset specific antibodies) , are 
selected for the generation of human monoclonal antibodies. 

10 D. Constructi on of unrearranged light chain minijocus 
transgenes 

1. pJCKl, a Jk, Ck containing vector for constructing 
minilbcus transgenes 

The 13 kb Ck containing Xhol insert of p36.5 is 

15 treated with Klenow enzyme and cloned into Hindlll digested, 
Klenow-treated, plasmid pGPld. A plasmid clone is selected 
such that the 5' end of the insert is adjacent to the vector 
derived Clal site. The resulting plasmid, p36.5-ld, is 
digested with Clal and Klenow-treated. The Jitl containing 7.4 

20 kb Xhol insert of p36.2 is then Klenow-treated and cloned into 
the Clal, Klenow-treated p36.5-ld. A clone is selected in 
which the p36.2 insert is in the same orientation as the p36.5 
insert. This clone, pJCKl (Fig. 34), contains the entire 
human Jk region and Ck, together with 7.2 kb of upstream 

25 sequences and 9 kb of downstream sequences. The insert also 
contains the human J-Ck intronic enhancer and may contain a 
human 3' * enhancer. The insert is flanked by a unique 3« 
Sail site for the purpose of cloning additional 3 • flanking 
sequences such as heavy chain or light chain enhancers. A 

30 unique Xhol site is located at the 5 1 end of the insert for 
the purpose of cloning in unrearranged Vk gene segments. The 
unique Sail and Xhol sites are in turn flanked by NotI sites 
that are used to isolate the completed transgene construct 
away from vector sequences. 



35 
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2 isolation of unrearranged V* gene segment and generation 
of transgenic animals expressing human Ig li^xt chain protein 

The V. specific oligonucleotide, oligo-65 (discussed 
above) , is used to probe a human placental genomic DNA library 
cloned into the phage vector 1EMBL3/5P6/T7 (Clonetech 
Laboratories, Inc., Palo Alto, CA, . Variable gene segments 
from the resulting clones are sequenced, and clones that 
appear functional are selected. Criteria for judging 
functionality include: open reading frames, intact splice 
acceptor and donor sequences, and intact recombination 
sequence. DNA fragments containing selected variable gene 
segments are cloned into the unique Xhol site of plasmid pJCKl 
to generate minilocus constructs. The resulting clones are 
digested with NotI and the inserts isolated and injected into 
m ouse embryo pronuclei to generate transgenic animals. The 
transgenes of these animals will undergo V to J joining m 
developing B-cells. Animals expressing human kappa chain are 
bred with heavy chain minilocus containing transgenic anima.s 
to generate mice expressing fully human antibodies. 

EXAMPLE 15 

G mis Hea^ T Chain Human Tct Transqene 

This Example describes the cloning of a human 
genomic heavy chain immunoglobulin transgene which is then 
introduced into the murine germline via microinjection into 
zygotes or integration in ES cells. 

Nuclei are isolated from fresh human placental 
tissue as described by Mar z luff , W.F., et al. (1985), 
^anscriEtion ^ TWSlS&Xsni h Practical Approach, B.D. 

** r \T * fto-129 IRL Press. Oxford). 

Hammes and S.J. Higgins, eds., pp. 89 129, ik^ w / 

The isolated nuclei (or PBS washed human spermatocytes) ar- 
embedded in 0.5% low melting point agarose blocks and lyse 
with 1 mg/ml proteinase K in 500m* EDTA, 1% SDS for nuclei, or 
with img/ml proteinase K in 500m* EDTA, 1% SDS, 10m* DTT for 
spermatocytes at 50-C for 18 hours. The proteinase K is 
^activated by incubating the blocks in 40 M g/ml PMSF in TE for 
30 minutes at 50-C, and then washing extensively with TE. The 
DNA is then digested in the agarose with the restriction 
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enzyme NotI as described by M. Finney in current Protocols jn 
Molecular Biology (F. Ausubel et al., eds. John Wiley & Sons, 
Supp. 4, 1988, e.g., Section 2.5.1). 

The NotI digested DNA is then fractionated by pulsed 
field gel electrophoresis as described by Anand et al. , Nuc^. 
Acids Res . 12:3425-3433 (1989). Fractions enriched for the 
NotI fragment are assayed by Southern hybridization to detect 
one or more of the sequences encoded by this fragment. Such 
sequences include the heavy chain D segments, J segments, and 
yl constant regions together with representatives of all 6 V H 
families (although this fragment is identified as 670 kb 
fragment from HeLa cells by Berman et al. (1988), supra., we 
have found it to be an 830 kb fragment from human placental 
and sperm DNA) . Those fractions containing this NotI 
fragment are ligated into the NotI cloning site of the vector 
pYACNN as described (Mccormick et al., Technique 2:65-71 
(1990)). Plasmid pYACNN is prepared by digestion of pYACneo 
(Clontech) with EcoRI and ligation in the presence of the 
oligonucleotide 5 1 - AAT TGC GGC CGC - 3 ' . 

YAC clones containing the heavy chain NotI fragment 
are isolated as described by Traver et al., Proc, Natl. Acad. 
Sci. USA . 86:5898-5902 (1989). The cloned NotI insert is 
isolated from high molecular weight yeast DNA by pulse field 
gel electrophoresis as described by M. Finney, op. cit. The 
DNA is condensed by the addition of 1 AM spermine and 
microinjected directly into the nucleus of single cell embryos 
previously described. Alternatively, the DNA is isolated by 
pulsed field gel electrophoresis and introduced into ES cells 
by lipofection (Gnirke et al., EMBO J . 10:1629-1634 (1991)), 
or the YAC is introduced into ES cells by spheroplast fusion. 

EXAMPLE 16 

Discontinuous Genomic Heavy Chain Tg Transaene 

An 85 kb Spel fragment of human genomic DNA, 
containing V H 6, D segments, J segments, the n constant region 
and part of the 7 constant region, has been isolated by YAC 
cloning essentially as described in Example l. A YAC carrying 
a fragment from the germline variable region, such as a 570 kb 
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NotI fragment upstream of the 6 -830 kb NotI fragment 
described above containing mult.pxe copies of V, through V 5 , is 
isolated as described. (Benuan et al. (198P) , supra, detected 
two 570 kb NotI fragments, each containing multiple V 
segments.) The two fragments are coinjected into the nucleus 
of a mouse single cell embryo as described in Example 1. 

Typically, coinjection of two different DNA 
fragments result in the integration of both fragments at the 
same insertion site within the chromosome.- Therefore, 
approximately 50% of the resulting transgenic animals that 
contain at least one copy of each of the two fragments will 
have the V segment fragment inserted upstream of the constant 
region containing fragment. Of these animals, about 50% will 
carry out V to DJ joining by DNA inversion and about 50% by 
deletion, depending on the orientation of the 570 kb NotI 
fragment relative to the position of the 85 kb Spel fragment. 
DNA is isolated from resultant transgenic animals and those 
animals found to be containing both transgenes by Southern 
blot hybridization (specifically, those animals containing 
both multiple human V segments and human constant region 
genes) are tested for their ability to express human 
immunoglobulin molecules in accordance with standard 
techniques. 

EXAMPLE 17 

|enti£isa£j o a of ftinetie -" y reerxaBggd variable regigy 
eoT ,»n G g S in transgenic B cells 

An antigen of interest is used to immunize (see 
Harlow and Lane, a ntibodies: A Laboratory Manual , Cold Spring 
Harbor, New York (1988)) a mouse with the following genetic 
traits: homozygosity at the ehdbgenous having chain locus for 
a deletion of J H (Examples 10) ; hemizygous for a single copy 
of unrearranged human heavy chain minilocus transgene 
(examples 5 and 14) ; and hemizygous for a single copy of a 
rearranged human kappa light chain transgene (Examples 6 and 

Following the schedule of immunization, the spleen 
is removed, and spleen cells used to generate hybridomas. 
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Cells from an individual hybridoma clone that secretes 
antibodies reactive with the antigen of interest are used to 
prepare genomic DNA. A sample of the genomic DNA is digested 
with several different restriction enzymes that recognize 
unique six base pair sequences, and fractionated on an agarose 
gel. Southern blot hybridization is used to identify two DNA 
fragments in the 2-10 kb range, one of which contains the 
single copy of the rearranged human heavy chain VDJ sequences 
and one of which contains the single copy of the rearranged 
human light chain VJ sequence. These two fragments are size 
fractionated on agarose gel and cloned directly into pUC18. 
The cloned inserts are then subcloned respectively into heavy 
and light chain expression cassettes that contain constant 

region sequences. 

The plasmid clone P7el (Example 12) is used as a 
heavy chain expression cassette and rearranged VDJ sequences 
are cloned into the Xhol site. The plasmid clone pCKl is used 
as a light chain expression cassette and rearranged VJ 
sequences are cloned into the Xhol site. The resulting clones 
are used together to transfect SP 0 cells to produce antibodies 
that react with the antigen of interest (Co. et al. f Proc T 
Natl, Acad. Sci. USA S£:2869 (1991), which is incorporated 
herein by reference) . 

Alternatively, mRNA is isolated from the cloned 
hybridoma cells described above, and used to synthesize cDNA. 
The expressed human heavy and light chain VDJ and VJ sequence 
are then amplified by PCR and cloned (Larrick et al., Biol- 
Technology . 2:934-938 (1989)). After the nucleotide sequence 
of these clones has been determined, oligonucleotides are 
synthesized that encode the same polypeptides, and synthetic 
expression vectors generated as described by Queen et al. , 
Proe. Natl- Acad. Sci. USA. . £4:5454-5458 (1989). 

Immunization of Transgenic Animals with Complex Antigen? 

The following experiment demonstrates that 
transgenic animals can be successfully immunized with complex 
antigens such as those on human red blood cells and respond 
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10 



with kinetics that are similar to the response kinetics 

observed in normal mice. 

Blood cells generally are suitable immunogens and 
comprise many different types of antigens on the surface of 
5 red and white blood cells. 



Tirnnnnizatj ^ with human blood 

Tubes of human blood from a single donor were 
collected and used to immunize transgenic mice having 
functionally disrupted endogenous heavy chain loci (J H D) and 
harboring a human heavy chain minigene construct (HC1) ; these 
mice are designated as Une 112. Blood was washed and 
resuspended in 50 mis Hanks' and diluted to 1x10* cells/ml 0.2 
mis (2X10? cells) were then injected interperitoneally using a 
28 gauge needle and 1 cc syringe. This immunization protocol 
was repeated approximately weekly for 6 weeks. Serum titers 
were monitored by taking blood from retro-orbital bleeds and 
collecting serum and later testing for specific antibody. A 
pre-immune bleed was also taken as a control. On the very 
last immunization, three days before these animals were 
sacrificed for serum and for hybridomas, a single immunization 
of 1 x 10 7 cells was given intravenously through the tail to 
enhance the production of hybridomas. 
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30 



Table 9 



Animals 



35 





Mouse ID 


Line 


Sex 


HC1-112 


JHD 
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2343 


112 
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+ 1 


++ 
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112 


M 
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3" 


2345 


112" 


F 
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1 2346 


112 


F 




++ 
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2347 


112 


F 




++ 


6 


2348 


112 


F 
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++ 
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2349 


112 


F 




+ 
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Mice # 2343 and 2348 have a desired phenotype :• human heavy 
chain mini-gene transgenic on heavy chain knock-out 
background . 

5 Generation of Hvbridomas 

Hybridomas were generated by fusing mouse spleen 
cells of approximately 16 week-old transgenic mice (Table 9) 
that had been immunized as described (supra) to a fusion 
partner consisting of the non-secreting HAT-sensitive myeloma 
10 cell line, X63 Ag8.653. Hybridoma clones were cultivated and 
hybridoma supernatants containing immunoglobulins having 
specific binding affinity for blood cell antigens were 
identified, for example, by flow cytometry. 

15 Flow cytometry 

Serum and hybridoma supernatants were tested using 
flow cytometry. Red blood cells from the donor were washed 4X 
in Hanks' balanced salt solution and 50,000 cells were placed 
in 1.1 ml polypropylene microtubes. Cells were incubated with 

20 antisera or supernatant from the hybridomas for 30 minutes on 
ice in staining media (lx RPMI 1640 media without phenol red 
or biotin (Irvine Scientific) 3% newborn calf serum, 0.1% Na 
azide) . Controls consisted of littermate mice with other 
genotypes. Cells were then washed by centrifugation at 4°C in 

25 Sorvall RT600B for 5-10 minutes at 1000 rpm. Cells were 
washed two times and then antibody detected on the cell 
surface with a fluorescent developing reagent. Two monoclonal 
reagents were used to test. One was a FITC-labeled mouse 
anti-human n heavy chain antibody (Pharmagen, San Diego, CA) 

30 and the other was a PE-labeled rat anti-mouse kappa light 
chain (Becton-Dickenson, San Jose, CA) . Both of these 
reagents gave similar results. Whole blood (red blood cells 
and white blood cells) and white blood cells alone were used 
as target cells. Both sets gave positive results. 

35 Serum of transgenic mice and littermate controls was 

incubated with either red blood cells from the donor, or white 
blood cells from another individual, washed and then developed 
with anti-human IgM FITC labeled antibody and analyzed in a 
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flow cytometer. Results showed that serum from mice that are 
transgenic for the human mini-gene locus (mice 2343 and 2348) 
show human ZgM reactivity whereas all littermate animals 
(2344, 2345, 2346, 2347) do not. Normal mouse serum (NS) and 
phosphate buffer saline (PBS) were used as negative controls. 
Red blood cells were ungated and white blood cells were gated 
to include only lymphocytes. Lines are drawn on the x and y 
axis to provide a reference. Flow cytometry was performed on 
100 supematants from fusion 2348. Four supernatants showed 
positive reactivity for blood cell antigens. 

EXAMPLE 18 

h y anM- fi t»nse RNA 

A. Vector for Expression of Antisense Ig Sequences 
1. construction of the cloning vector pGPlh 
The vector pGPlb (referred to in a previous example) 

is digested with Xhol and BamHI and ligated with the following 

oligonucleotides : 

5'- gat cct cga gac cag gta cca gat ctt gtg aat teg -3' 
5 .- teg acg aat tea caa gat ctg gta cct ggt etc gag -3« 

to Generate the plasmid pGPlh. This plasmid contains a 
polylinker that includes the following restriction sxtes: 
NotI, EcoRI, BglH, As P 718, Xhol, BamHI, Hindlll, Notl. 

Construction of pBCEl. 

A 0.8 kb Xbal/Bglll fragment of pVH251 (referred to 
in a previous example) , that includes the promoter leader 
sequence exon, first intron, and part of the second exon of 
the human VH-V family immunoglobulin variable gene segment, 
was inserted into Xbal/Bglll digested vector P NN03 to generate 

the plasmid pVH251. 

The 2.2 kb BamHI /EcoRI DNA fragment that includes 
the coding exons of the human growth hormone gene (hGH; 
Seeburg, (1982) BNA 1:239-249) is cloned into Bglll/EcoRI 
digested pGHlh. The resulting plasmid is digested with BamHI 
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and the BamHI/Bglll of pVH251N is inserted in the same 
orientation as the hGH gene to generate the plasmid pVhgh. 

A 0.9 kb Xbal fragment of mouse genomic DNA 
containing the mouse heavy chain J-m intronic enhancer 
5 (Banerji et al., (1983) Cell 33:729-740) was subcloned into 
pUC18 to generate the plasmid pJH22.1. This plasmid was 
linearized with SphI and the ends filled in with klenow 
enzyme. The klenow treated DNA was then digested with Hindlll 
and a 1.4 kb Mlul (klenow) /Hindlll fragment of phage clone Xl.3 

10 (previous example) , containing the human heavy chain J-m 

intronic enhancer (Hayday et al.. (1984) Nature 307:334-340), 
to it. The resulting plasmid, pMHEl , consists of the mouse 
and human heavy chain J-n intron enhancers ligated together 
into pUC18 such that they can be excised on a single 

15 BamHI /Hindlll fragment. 

The BamHI/Hindlll fragment of pMHEl is cloned into 
BamHI/Hindlll cut pVhgh to generate the B-cell expression 
vector pBCEl. This vector, depicted in Fig. 36, contains 
unique Xhol and Asp718 cloning sites into which antisense DNA 

20 fragments can be cloned. The expression of these antisense 
sequences is driven by the upstream heavy chain promoter- 
enhancer combination the downstream hGH gene sequences provide 
polyadenylation sequences in addition to intron sequences that 
promote the expression of transgene constructs. Antisense 

25 transgene constructs generated from pBCEl can be separated 
from vector sequences by digestion with Notl. 

B. An IgM antisense transgene construct. 

The following two oligonucleotides: 

30 

5'- cgc ggt acc gag agt cag tec ttc cca aat gtc -3' 
5 • - cgc etc gag aca get gga atg ggc aca tgc aga -3 ' 

are used as primers for the amplification of mouse IgM 
35 constant region sequences by polymerase chain reaction (PCR) 
using mouse spleen cDNA as a substrate. The resulting 0.3 kb 
PCR product is digested with Asp718 and Xhol and cloned into 
Asp718/XhoI digested pBCEl to generate the antisense transgene 
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construct pMASl. The purified NotI insert of pMASl is 
microinjected into the pronuclei of half day mouse embryos- 
alone or in combination with one or more other transgene 
constructs-to generate transgenic mice. This construct 
5 expresses an RNA transcript in B-cells that hybridizes with 
mouse IgM mRNA, thus down-regulating the expression of mouse 
IgM protein. Double transgenic mice containing pMASl and a 
human heavy chain transgene minilocus such as pHCl (generated 
either by collection of both constructs or by breeding of 

10 singly transgenic mice) will express the human transgene 

encoded Ig receptor on a higher percentage of B-cell than mxce 
transgenic for the human heavy chain minilocus alone. The 
ratio of human to mouse Ig receptor expressing cells is due in 
part to competition between the two populations for factors 

15 and cells that promoter B-cell differentiation and expansion. 
Because the Ig receptor plays a key role in B-cell 
development, mouse Ig receptor expressing B-cells that express 
reduced levels of IgM on their surface (due to mouse Ig 
specific antisense down-regulation) during B-cell development 

20 will not compete as well as cells that express the human 
receptor. 

C. An IgKappa antisense transgene construct. 

The following two oligonucleotides: 



25 



5«- cgc ggt acc get gat get gca cca act gta tec -3' 
5«- cgc etc gag eta aca etc att cct gtt gaa get -3' 

are used as primers for the amplification of mouse IgKappa 
30 constant region sequences by polymerase chain reaction (PCR) 
• using mouse spleen cDNA as a substrate. The resulting 0.3 kb 
PCR product is digested with Asp718 and Xhol and cloned into 
As P 718/XhoI digested pBCEl to generate the antisense transgene 
construct pKASl. The purified NotI insert of pKASl is 
35 microinjected into the pronuclei of half day mouse embryos- 
alone or in combination with one or more other transgene 
constructs—to generate transgenic mice. This construct 
expresses an RNA transcript in B-cells that hybridizes with 
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mouse IgK mRNA, thus down-regulating the expression of mouse 
IgK protein as described above for pMASl. 

EXAMPLE 19 

5 This example demonstrates the successful 

immunization and immune response in a transgenic mouse of the 
present invention. 

Immunization of Mice 

10 Keyhole limpet hemocyanin conjugated with greater 

than 400 dinitrophenyl groups per molecule (Calbiochem, La 
Jolla, California) (KLH-DNP) was alum precipitated according 
to a previously published method (Practical Immunology, L. 
Hudson and F.C. Hay, Blackwell Scientific (Pubs.). P- 9, 

15 1980) . Four hundred /ig of alum precipitated KLH-DNP along 

with 100 nq dimethyldioctadecyl Ammonium Bromide in 100 /iL of 
phosphate buffered saline (PBS) was injected intraperitoneal^ 
into each mouse. Serum samples were collected six days later 
by retro-orbital sinus bleeding, 

20 

Analysis of Human Antibody Reactivity in Serum 

Antibody reactivity and specificity were assessed 
using an indirect enzyme-linked immunosorbent assay (ELISA) • 
Several target antigens were tested to analyze antibody 

25 induction by the immunogen. Keyhole limpet hemocyanin 
(Calbiochem) was used to identify reactivity against the 
protein component, bovine serum albumin-DNP for reactivity 
against the hapten and/or modified amino groups, and KLH-DNP 
for reactivity against the total immunogen. Human antibody 

30 binding to antigen was detected by enzyme conjugates specific 
for IgM and IgG sub-classes with nb cross reactivity to mouse 
immunoglobulin- Briefly, PVC microtiter plates were coated 
with antigen drying overnight at 37 *c of 5 ng/mL protein in 
PBS. Serum samples diluted in PBS, 5% chicken serum, 0.5% 

35 Tween-20 were incubated in the wells for 1 hour at room 
temperature, followed by anti-human IgG Fc and IgG F(ab')- 
horseradish peroxidase or anti-human IgM Fc-horseradish 
peroxidase in the same diluent • After 1 hour at room 
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temperature enzyme activity was assessed by addition of ABTS 
substrate (Sigma, St. Louis, Missouri) and read after 30 
minutes at 415-490 nm. 

p a i-f iciua^ -i"" i" If"" np Response 



Human He avy Chan 
Transgen ic Mice 

Figure 37 illustrates the response of three mouse 
iittermates to immunization with KLH-DNP. Mouse number 1296 
carried the human ZgN and IgG unrearranged transgene and was 
homozygous for mouse Ig heavy chain knockout. Mouse number 
1299 carried the transgene on a non-knockout background, while 
mouse 1301 inherited neither of these sets of genes. Mouse 
1297, another littermate, carried the human transgene and was 
hemizygous with respect to mouse heavy chain knockout. It was 
included as a non-immunized control. 

The results demonstrate that both human IgG and IgM 
responses were developed to the hapten in the context of 
conjugation to protein. Human IgM also developed to the KLH 
molecule, but no significant levels of human IgG were present 
at this time point. In pre-immunization serum samples from 
the same mice, titers of human antibodies to the same target 
antigens were insignificant. 

EXAMPLE 20 

This example demonstrates the successful 
immunization with a human antigen and immune response in a 
transgenic mouse of the present invention, and provides data 
demonstrating that nonrandom somatic mutation occurs in the 
variable region sequences of the human transgene. 




30 



antigen 

Transgenic mice used for the experiment were 
35 homozygous for functionally disrupted murine immunoglobulin 
heavy chain loci produced by introduction of a transgene at 
the joining (J) region (sufira) resulting in the absence of 
functional endogenous (murine) heavy chain production. The 
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transgenic mice also harbored at least one complete 
prearranged human heavy chain mini-locus transgene, (HCl, 
supra ) , which included a single functional V H gene (V H 251) , 
human M constant region gene, and human 7 1 constant region 
5 gene. Transgenic mice shown to express human immunoglobulin 
transgene products (supra) were selected for immunization with 
a human antigen to demonstrate the capacity of the transgenic 
mice to make an immune response against a human antigen 
immunization. Three mice of the HC1-26 line and three mice of 
10 the HC1-57 line ( supra ) were injected with human antigen. 

One hundred /ig of purified human carcinoembryonic 
antigen (CEA) insolubilized on alum was injected in complete 
Freund's adjuvant on Day 0, followed by further weekly 
injections of alum-precipitated CEA in incomplete Freund's 
15 adjuvant on Days 7, 14, 21, and 28. Serum samples were 
collected by retro-orbital bleeding on each day prior to 
injection of CEA. Egual volumes of serum were pooled from 
each of the three mice in each group for analysis. 

Titres of human n chain-containing immunoglobulin 
20 and human 7 chain-containing immunoglobulin which bound to 
human CEA immobilized on microtitre wells were determined by 
ELISA assay. Results of the ELISA assays for human m chain- 
containing immunoglobulins and human 7 chain-containing 
immmunoglbulins are shown in Figs. 38 and 39, respectively. 
25 Significant human m chain Ig titres were detected for both 
lines by Day 7 and were observed to rise until about Day 21. 
For human 7 chain Ig, significant titres were delayed, being 
evident first for line HC1-57 at Day 14, and later for line 
HC1-26 at Day 21. Titres for human 7 chain Ig continued to 
30 show an increase over time during the course of the 

experiment. The observed human n chain Ig response, followed 
by a plateau, combined with a later geveloping 7 chain 
response which continues to rise is characteristic of the 
pattern seen with affinity maturation. Analysis of Day 21 
35 samples showed lack of reactivity to an unrelated antigen, 
keyhole limpet hemocyanin (KLC) , indicating that the antibody 
response was directed against CEA in a specific manner. 
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These data indicate that animals transgenic for 
human unrearranged immunoglobulin gene loci: (1) can respond 
to a human antigen (e.g., the human glycoprotein, CEA) , (2) 
can undergo isotype switching ("class switching) as 
exemplified by the observed M to y class switch, and (3) 
exhibit characteristics of affinity maturation in their 
humoral immune responses. In general, these data indicate: 
(1) the human Ig transgenic mice have the ability to induce 
heterologous antibody production in response to a defined 
antigen, (2) the capacity of a single transgene heavy chain 
variable region to respond to a defined antigen, (3) response 
kinetics over a time period typical of primary and secondary 
response development, (4) class switching of a transgene- 
encoded humoral immune response from IgM to IgG, and (5) the 
capacity of transgenic animal to produce human-sequence 
antibodies against a human antigen. 



aS ^tin B ju£3fciSB Hi * h»™n heavy chain 

transgene minilocus . 

Line HC1-57 transgenic mice, containing multiple 
copies of the HC1 transgene, were bred with immunoglobulin 
heavy chain deletion mice to obtain mice that contain the HC1 
transgene and contain disruptions at both alleles of the 
endogenous mouse heavy chain (supra) . These mice express 
human mu and gammal heavy chains together with mouse kappa and 
lambda light chains (supra) . One of these mice was 
hyperimmunized against human carcinoembryonic antigen by 
repeated intraperitoneal injections over the course of 1.5 
months. This mouse was sacrificed and lymphoid cells isolated 
from the spleen, inguinal and mesenteric lymph nodes, and 
peyers patches. The cells were combined and total HNA 
isolated. First strand cDNA was synthesized from the RNA and 
used as a template for PGR amplification with the following 2 
oligonucleotide primers: 

149 s '-eta get cga gtc caa gga gtc tgt gec gag gtg cag ctg 
(g/a/t/c)-3' 

151 5*-ggc get cga gtt cca cga cac cgt cac egg ttc-3 
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These primers specifically amplify VH251/gammal cDNA 
sequences. The amplified sequences were digested with Xhol 
and cloned into the vector pNN03. DNA sequence from the 

* inserts of 23 random clones is shown in Fig. 40; sequence 

5 variations from germline sequence are indicated, dots indicate 

* sequence is identical to germline. Comparison of the cDNA 
sequences with the germline sequence of the VH251 transgene 
reveals that 3 of the clones are completely unmutated, while 
the other 20 clones contain somatic mutations. One of the 3 

10 non-mutated sequences is derived from an out-of-frame VDJ 
joint. Observed somatic mutations at specific positions of 
occur at similar frequencies and in similar distribution 
patterns to those observed in human lymphocytes (Cai et al. 
(1992) J. Exp. Med. 176 : 1073 , incorporated herein by 

15 reference) . The overall frequency of somatic mutations is 
approximately 1%; however, the frequency goes up to about 5% 
within CDR1, indicating selection for amino acid changes that 
affect antigen binding. This demonstrates antigen driven 
affinity maturation of the human heavy chain sequences. 

20 

EXAMPLE 21 

This example demonstrates the successful formation 
of a transgene by co- introduction of two separate 
polynucleotides which recombine to form a complete human light 
25 chain minilocus transgene. 

Generation of an unrearranoed light chain minilocus transgene 

bv co-iniection of two overlapping DNA fragments 

1. Isolation of unrearranged functional V K gene segments 

30 vk65.3, vk65.5, VK65.8 and vk65.15 

■ The V K specific oligonucleotide, oligo-65 (S^agg 

ttc agt ggc agt ggg tct ggg aca gac ttc act etc acc ate agc- 
3*), was used to probe a human placental genomic DNA library 
cloned into the phage vector XEMBL3/SP6/T7 (Clonetech 

35 Laboratories , Inc., Palo Alto, CA) . DNA fragments containing 
V K segments from positive phage clones were subcloned into 
plasmid vectors. Variable gene segments from the resulting 
clones are sequenced, and clones that appear functional were 
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selected. Criteria for judging functionality include: open 
reading frames, intact splice acceptor and donor sequences, 
and intact recombination sequence. DNA sequences of 4 
functional V K gene segments (vk65.3, vk65.5, vk65.8, and 
vk65.15) from 4 different plasmid clones isolated by thxs 
procedure are shown in Figs. 41-44. The four plasmid clones, 
P 65.3f, P 65.5gl, P65.8, and p65.15f, are described below. 

(1 a) p65.3f 

A 3 kb Xba fragment of phage clone X65.3 was 
subcloned into P UC19 so that the vector derived Sail site was 
proximal to the 3 • end of the insert and the vector derived 
BamHI site 5'. The 3 kb BamHI/Sall insert of this clone was 
subcloned into pGPlf to generate p65.3f. 

(1 b) p65.5gl 

A 6.8 kb EcoRI fragment of phage clone X65.5 was 

subcloned into pGPlf so that the vector derived Xhol site is 

proximal to the 5' end of the insert and the vector derived 

Sail site 3». The resulting plasmid is designated P 65.5gl. 

(1 C) p65.8 

A 6.5 kb Hindlll fragment of phage clone X65.8 was 
cloned into pSP72 to generate p65.8. 



25 



(1 d) p65.15f 

A 10 kb EcoRI fragment of phage clone X65.16 was 
subcloned into P UC18 to generate the plasmid p65.15.3. The V K 
gene segment within the plasmid insert was mapped to a 4.6 kb 
30 EcoRI/HindHI subfragment, which was cloned into pGPlf . The 
resulting clone, P 65.15f, has unique Xhol and Sail sites 
located at the respective 5- and 3 • ends of the insert. 



2. pKV4 

35 The Xhol/Sall insert of p65.8 was cloned into the 

Xhol site of P 65.15f to generate the plasmid pKV2. The 
Xhol/Sall insert of P 65.5gl was cloned into the Xhol site of 
pKV2 to generate pKV3. The Xhol/Sall insert of pKV3 was 
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cloned into the Xhol site of p65.3f to generate the plasmid 
pKV4. This plasmid contains a single 21 kb Xhol/Sall insert 
that includes 4 functional V K gene segments. The entire 
insert can also be excised with Notl. 

3. pKClB 
(3 a) pKcor 

Two Xhol fragments derived from human genomic DNA 
phage X clones were subcloned into plasmid vectors. The 
first, a 13 kb J K 2-J K 5/C K containing fragment, was treated with 
Klenow enzyme and cloned into Hindlll digested, Klenow 
treated, plasmid pGPld. A plasmid clone (pK-31) was selected 
such that the 5' end of the insert is adjacent to the vector 
derived Clal site. The second Xhol fragment, a 7.4 kb piece 
of DNA containing J K 1 was cloned into XhoI/Sall-digested 
pSP72, such that the 3' insert Xhol site was destroyed by 
ligation to the vector Sail site. The resulting clone, 
p36.2s, includes an insert derived Clal site 4.5 kb upstream 
of J x l and a polylinker derived Clal site downstream in place 
of the naturally occurring Xhol site between J K 1 and J K 2. This 
clone was digested with Clal to release a 4.7 kb fragment 
which was cloned into Clal digested pK-31 in the correct 5» to 
3' orientation to generate a plasmid containing all 5 human J K 
segments, the human intronic enhancer human C^, 4.5 kb of 5' 
flanking sequence, and 9 to of 3' flanking sequence. This 
plasmid, pKcor, includes unique flanking Xhol and Sail sites 
on the respective 5' and 3* sides of the insert. 

(3 b) pKcorB 

A 4 kb BamHI fragment containing the human 3' kappa 
enhancer (Judde, J.-G. and Max, E.E. (1992) MqI, Cell. Bjol t 
i£: 5206, incorporated herein by reference) was cloned into 
pGPlf such that the 5' end is proximal to the vector Xhol 
site. The resulting plasmid, p24Bf, was cut with Xhol and the 
17.7 kb Xhol /Sail fragment of pKcor cloned into it in the same 
orientation as the enhancer fragment. The resulting plasmid, 
pKcorB, includes unique Xhol and Sail sites at the 5' and 3' 
ends of the insert respectively. 
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(3 c) pKClB 

The Xhol/Sall insert of pKcorB was cloned into the 
sail site of P 65.3f to generate the light-chain minilocus- 
transgene plasmid pKClB. This plasmid includes a single 
functional human V K segment, all 5 human J K segments, the human 
intronic enhancer, human C K , and the human 3- kappa enhancer. 
The entire 25 Jcb insert can be isolated by NotI digestion. 



10 



4. Co4 

Tfc two NotI inserts from plasmids pKV4 and pKClB 
were mixed at a concentration of 2.5 nq/nl each in 
microinjection buffer, and co-injected into the pronuclei of 
half day mouse embryos as described in previous examples. 
Resulting transgenic animals contain transgene inserts 
15 (designated co4, product of the recombination shown in Fig. 
45) in which the two fragments co- integrated. The 3' 3 kb of 
the P KV4 insert and the 5 '3 kb of the pKClB insert are 
identical. Some of the integration events will represent 
homologous recombinations between the two fragments over the 3 
20 kb of shared sequence. The co4 locus will direct the 

expression of a repertoire of human sequence light chains in a 

transgenic mouse. 

The foregoing description of the preferred 
embodiments of the present invention has been presented for 
25 purposes of illustration and description. They are not 

intended to be exhaustive or to limit the invention to the 
precise form disclosed, and many modifications and variations 
are possible in light of the above teaching. 

All publications and patent applications herein are 
30 incorporated by reference to the same extent as if each 

individual publication or. patent application was specifically 
and individually indicated to be incorporated by reference. 

Although the present invention has been described in 
some detail by way of illustration for purposes of clarity of 
understanding, it will be apparent that certain changes and 
modifications may be practiced within the scope of the claims. 



35 



WO 93/12227 



PCT/US92/10983 



131 

WHAT IS CLAIMED IS: 

TRANSGENE CLAIMS 
1. An isolated immunoglobulin heavy chain 
5 transgene that is expressed in B cells of a transgenic 

nonhuman animal containing at least one integrated copy of a 
polynucleotide comprising a DNA sequence of the formula: 

(VH)x-( D )y-( J H)z-( S D)m-( C l)n-[( T )-( S A)p-( C 2>.]q 

10 

wherein x, y, z, m, n, p, and q are integers and x is 2-100, 
n is 1-10, y is 2-50, p is 1-10, z is 1-50, q is 0-50, and m 
is 6-10. 

15 2. A transgene of Claim 1, wherein said 

polynucleotide comprises at least one heterologous D gene 
segment that can be incorporated into a functionally 
rearranged V-D-J sequence. 

20 3. A transgene of Claim 2, wherein said 

heterologous D gene segment contains at least one human D 
gene . 

4. A transgene of Claim 1, wherein said 

25 polynucleotide comprises a human m C h gene segment and a human 
7l C H 9 ene segment. 

5. A transgene of Claim 1, wherein q is at least 
1, m is at least 1, n is at least 1, and said polynucleotide 

30 comprises at least about 50 basepairs of a segment immediately 
upstream of a germline switch sequence. 

6. A transgene of Claim 5, wherein said 
polynucleotide comprises about 200 basepairs of sequence 

35 immediately upstream of a human germline y x switch sequence. 

7. A transgene of Claim 1, wherein a S D segment is 
a y 1 switch sequence. 
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8. A transgene of Claim 1, wherein said 
polynucleotide comprises about 200 basepairs naturally 
upstream of a human germline 7l switch sequence, and wherein 
said 200 basepairs are operably linked to a human 7l switch 
sequence. 

9. A transgene of Claim 1, wherein said D region 
comprises only heterologous D genes. 

10. A transgene of Claim 9, wherein said D region 
comprises only human D genes. 

11. A transgene of Claim 1, wherein the 
polynucleotide is functionally rearranged in vivo to produce a 
rearranged V-D-J gene segment that contains a recognizable D 
region gene sequence. 

12. A transgenic non-human animal comprising a 
transgene of Claim 1 in the germline of said non-human animal. 

13. A transgenic non-human animal of Claim 12 , 
wherein said transgene is rearranged. 

14. A transgenic non-human animal of Claim 12, 
wherein said transgene is unrearranged. 

15. A transgenic non-human animal of Claim 14, 
wherein said B cells produce a heterologous antibody. 

16. A transgenic non-human animal of Claim 15, 
wherein said B cells produce a population of heterologous 
antibodies of more than one isotype. 

17. A transgenic non-human animal of Claim 12, 
wherein said transgene encodes V H , D, J H , and C H regions. 

18. A transgenic non-human animal of Claim 12 
wherein said non-human animal is a rodent. 
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19. A transgenic non-human animal of Claim 12, 
wherein serum of said animal comprises human antibodies that 
contain a recognizable D region gene sequence. 

5 20. A transgenic non-human animal of Claim 12 , 

wherein at least one lymphocyte of said animal contains a mRNA 
encoding a heterologous immunoglobulin chain. 

21. A transgenic animal of claim 20, wherein said 
10 mRNA contains a recognizable D region gene sequence. 

22. A transgenic animal of claim 21, wherein said 
mRNA contains a functionally rearranged V-D-J sequence. 

15 23. A transgenic animal of claim 12, wherein said 

transgenic animal comprises heterologous antibodies which 
comprise a human-sequence m chain and which specifically bind 
to an antigen. 

20 24. A transgenic animal of claim 23 , further 

comprising heterologous antibodies which comprise a human- 
sequence y chain and which specifically bind to an antigen. 

25. A transgenic animal of claim 24 , wherein the 
25 antigen is a human antigen. 

26. A transgenic animal of claim 25, wherein the 
human antigen is CEA or a human blood cell antigen. 

30 27. A transgenic nonhuraan animal comprising a human 

transgene of claim 1, wherein this transgenic animal further 
comprises lymphoid tissue containing a population of mRNA 
species having somatic mutations clustered in CDR regions of a 
variable region encoded by an immunoglobulin transgene. 

35 

28. A transgenic nonhuman animal of claim 27, 
wherein the immunoglobulin transgene is a heavy chain 
minilocus corresponding to HCl. 
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29. A hybridoma comprising a transgenic non-human B 
cell fused with a second cell capable of immortalizing said B 
cell, wherein said hybridoma produces a monoclonal antxbody 
heterologous to said non-human animal. 

5 

30. A hybridoma of claim 29 wherein the 
heterologous antibody comprises a human heavy chain containing 
a recognizable D region gene sequence. 

10 3i. A hybridoma of Claim 29 wherein said B cell is 

of murine origin. 

32. A hybridoma of Claim 29 wherein the monoclonal 
antibody binds to a human antigen with an affinity of at least 

15 1 x 10 7 M" 1 . 

33. A human monoclonal antibody produced by a 

hybridoma of Claim 29. 

34. A transgenic non-human animal having serum 
20 comprising detectable heterologous antibodies and having at 

least one suppressed endogenous immunoglobulin locus. 

35. A method for producing heterologous 
immunoglobulins from a transgenic nonhuman animal, the animal 

25 having a genome comprising germline copies of at least one 
transgene of Claim 1, the method comprising: 

suppressing an endogenous immunoglobulin locus; 
contacting the animal with a preselected antigens- 
collecting said heterologous immunoglobulins. 



35 



and 



36. A method according to Claim 35, wherein 
suppression is produced by an antisense polynucleotide. 

37. A method according to Claim 36, wherein the 
antisense polynucleotide is transcribed from an integrated 
antisense transgene. 
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38. A method according to Claim 37 , wherein 
transcription of the antisense transgene produces a transcript 
containing an antisense sequence linked to a second sequence. 

5 39. A method according to Claim 37, wherein the 

* non-human animal has a genome comprising germline copies of at 

least one light chain immunoglobulin transgene. 

40. A method for suppressing at least one 

10 endogenous immunoglobulin locus in a transgenic non-human 

animal, comprising the steps of: 

introducing an antisense transgene into a non-human 

animal to produce a non-human transgenic animal bearing an 

antisense transgene; 
15 transcribing antisense RNA from the antisense 

transgene in vivo : 

hybridizing the antisense RNA to a polynucleotide 
containing an endogenous immunoglobulin sequence; and 

inhibiting expression of an endogenous 
20 immunoglobulin chain. 

41. A method of Claim 40 , wherein said antisense 
transgene contains a nucleotide sequence that is homologous to 
an endogenous kappa chain immunoglobulin gene sequence. 

25 

42. A method of Claim 40 , wherein said antisense 
transgene contains a nucleotide sequence that is homologous to 
an endogenous heavy chain immunoglobulin gene sequence. 

30 43. A method for inactivating an endogenous 

immunoglobulin gene, comprising the steps of: 

integrating a targeting vector into an endogenous 
immunoglobulin gene; and 

selecting for a cell bearing an integrated targeting 

35 vector. 



44. A method according to Claim 43, further 
comprising the step of: 
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generating a line of non-human animals bearing a 
copy of the integrated targeting vector. 

45. A method according to Claim 43, wherein the 
endogenous immunoglobulin gene is a light chain gene. 

46. A method according to Claim 45, wherein the 
light chain gene is a kappa light chain gene. 

47. A method according to Claim 43, wherein the 
endogenous immunoglobulin gene is a heavy chain gene. 

48. A method according to Claim 43, wherein the 
endogenous immunoglobulin gene is a murine immunoglobulin 
gene. 

49. A method according to Claim 44, wherein the 
non-human animal is a mouse. 

50. A method according to Claim 49, wherein the 
mouse further comprises a human immunoglobulin transgene of 
Claim 1. 

51. A method according to claim 43, wherein the 
cell is an embryonic stem cell. 

52. A method of generating a non-human animal 
having an inactivated endogenous immunoglobulin gene, 

comprising the steps of: 

breeding a line of non-human animals generated by 

the method of Claim. 44 ; and 

identifying individual non-human animal offspring 
that are homozygous for an inactivated immunoglobulin gene. 

53. A method according to Claim 52, wherein said 
offspring have an inactivated heavy chain immunoglobulin gene 
and at least one inactivated light chain immunoglobulin gene. 
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54. A transgenic non-human animal with an 
inactivated endogenous immunoglobulin gene. 

55. A non-human animal of Claim 54, wherein an 
5 integrated targeting vector is present in germline DNA. 

56. A non-human animal of Claim 55, wherein a light 
chain immunoglobulin gene is inactivated. 

!0 57. A non-human animal of Claim 55, wherein a heavy 

chain immunoglobulin gene is inactivated. 

58. A non-human animal of Claim 55 which is 
homozygous for at least one inactivated endogenous 

15 immunoglobulin gene. 

59. An antisense transgene comprising a nucleotide 
sequence that is complementary to a polynucleotide sequence 
that is substantially identical to an immunoglobulin gene 

20 sequence. 

60. A transgenic non-human animal bearing an 
antisense transgene of Claim 59, 

25 61. A transgenic non-human animal comprising a 

functionally disrupted endogenous heavy chain locus and a 
heterologous immunoglobulin heavy chain transgene, wherein 
said animal makes an antibody response following immunization 
with an antigen. 

30 

62. A transgenic non-human animal of Claim 61, 
wherein said functionally disrupted endogenous heavy chain 
locus is a J H region homologous recombination knockout, said 
heterologous immunoglobulin heavy chain transgene is the HC1 

35 human minigene transgene, and said antigen is a human antigen. 

63. A transgenic non-human animal of Claim 61, 
wherein the antibody response comprises a population of 
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antibodies which comprise human p chain-containing 
iBBunoglobulins and human y chain-containing immunoglobulins. 

64. A transgenic non-human animal of Claim 63, 

5 wherein the heterologous antibodies comprise a population of 
heterologous immunoglobulins which comprise somatic mutation 
in the variable regions which cluster in the CDRs. ■ 

65. A transgenic non-human animal of Claim 63, 

10 wherein the antigen is selected from the group consisting of 
human blood cell surface antigens, KLH, and human CEA. 
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TTTTCTGQCC TGACAAOCAG GGTGGCGCAG GATGCTCAGT GCAGAGAGGA 50 

AGAAGCAGGT GGTCTCTGCA GCTGGAAGCr CAGCTCCCAC CCAGCTGCTT 100 

TOCATCTOCC TCOTGCTGC CCTAOCTTCC AGAQCXX^JII^aJjOCTG 150 

TCTCAGAGCC CIX3GGGAGGA ACTGCTCAGT TAGGAOOCAG AQGGAAGCAT 200 

Ms 

GGAAGCCCCA GCrcaQCTTC TCTTOCTOCT GCTACTCTGG CTCCCAGgtg 250 
tGluAlaPro AlaGlnLeuL euPheLeuLe uLeuLeuTrp LeuPrb 
agggggaacc atgaggtggt tttgcacatt agtgaaaact cttgccacct 300 

ctgctcagca agaaatataa ttaaaattca aagtatatca acaattttgg 350 

ctctactcaa agacagttgg tttgatcttg attacatgag tgcatttctg 400 

ttttatttcc aatttcagAT AOCAOCGGAG AAATTGTGTT GACACAGTCT 450 

Asp ThrThrGlyG luIleValLe uThrGlnSer 
CEAGCCAOCC TGTCTTTGTC TCCAGGGGAA AGAG0CAO0C TCTCCTGCAG 500 
ProAlaThrL euS erLe uSe rProGlyGlu ArgAlaThrL euSerCysAr 
QGCCAGTCAG AGTG1TAGCA. GCTACTTAGC CTGGTACCAA CAGAAACCTG 550 
gAlaSerGln SerValSerS erTyrLeuAl aTrpTyrGln GlnLysProG 
G0CAGGC1C C eaGGCTOCTC ATCTATGATG CATCCAACAG GGOCACTGGC 600 
lyGlnAlaPr oArgLeuLeu IleTyrAspA laSerAsnAr gAlaThrGly 
ATOOCAGOCA GG3TCAGTGG CAGTQGGTCT QGGACAGACT TCACTCTCAC 650 
IleProAlaA rgPheSerGl ySerGlySer GlyThrAspP heThrL^uTh 
CATCAGCAGC CIAGAGCCTG AAGATTTTGC AGTTTATIAC TGTCAGCAGC 700 
rlleSerSer LeuGluProG luAspPheAl aVal TvrTvr CysGl nGlnA 
GIAGCAACTG GOCTOrt^rft (TfTftTTnTRT ATO^CSi^MItoACA 750 

rgSerAsnlr pPro 

AGAOCATCAG TGTTTACTAG ATTAriATAC CAGCTGCTTC CTTIACAGAC 800 

AGCTAGTQQG GT 812 
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AGGGCGGCGC AGATGCTCAG TGCAGAGAGA AGAAACAGGT GGICTCTGCA 50 

GCTGGAAGCT CAQCTCCTAC OCX2GCTGCT TTQCATCTOC CTCCCAGCTG 100 

CCCIAOCITC CAGAQCaE^Bxcr GGGTC30GC TCTEGGGAGG 150 

AACIGCTCAG TTAGGAQOCA GACX3GAACCA TGGAAGCOCC AGCGCAGCIT 200 

M etGluAlaPr oAlaGlnLeu 

CTCTTCCTCC TGCIACTCTG GCTCACAGgt gaggggaata tgaggtgtct 250 
LeuPheLeuL euLeuLeuTr pLeuThr 

ttgcacatca gtgaaaactc ctgccacctc tgctcagcaa gaaatataat 300 

taaaattcaa aatagatcaa caattttggc tctactcaaa gacagtgggt 350 

ttgattttga ttacatgagt gcatttctgt tttatttcca atttcagAlA 400 

- " " AspT 

CEACOGGAGA AATJGTGTTG ACACAGTCTC CAGCCAOCCT GTCTTTGTCT 450 
hrThrGlyGl uIleValLeu ThrGlnSerP roAlaThrte uSerLeuSer 
CCAGGGGAAA GAQXAOXT OTCTGCAGG GOCAGTCAGG GTGTTAGCAG 500 
ProGlyGluA rgAlaThrLe uSerCysArg AlaSerGlnG lyValSerSe 

ozotagoc tggtaccagc agaaacctgg ccaggctccc aggctcctca 550 

rlyrLeuAla TrpTyrGlnG InLysProGI yGlnAlaPro ArgLeuIeuI 
TCIATCA3GC ATGCAACAGG GOCACTGGCA TCOCAGCCAG GTTCAGTGGC 600 
leTyiAspAl aSerAsnArg AlaThrGlyl leProAlaAr gPheSerGly 
AGTGGGCCTG GGACAGACTT QOCTCACC AICAGCAGCC TAGAGCCTGA 650 
SerGlyProG lyThrAspPh eThrLeuThr IleSerSerL euGluProGl 
AGAITTTGCA GTTIATIACT GTCAGCAGCG TAGCAACTGG CATCCCACAQ 
uAspPheAla ValTvrTyrC ysGlnGlnAr gSerAsnTrp His 
iTTjaWrAHA T^&j&rft&RA AfTt rAACAA GACCATCAGT GTTTA 

TTATIAIACC AGCTGCTTCC TTTACAGACA GCTAGTGGGG TGGCCACTCA 800 

GIGTTAGCAT CICAGCTCIA TTTGGCCATT TTGGAGTTCA AGTTGTCAAG 850 

TOCAAAATTA CrTATGTTAG TOCATTGCAT CATACEAITT CAGTGTGGCT 900 
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CCQCXrCAQC TGCTTTGCAT GTCCCTCCCA GOCGCCCTGC AGTCCAGAGC 50 

rr^TATTAATl GCCTGGGTCA. GAGCTCTGSA GAAGAGCTGC TCAGTIAGGA 100 

ACOCCAGAGG GAACCATCGA AACCCCAGCG CAGCTTCTCT TCCTCCTGCT 150 

MstGl ulhrProAla GlnLeuLeuP helaul£suD5 

ACTCTGGCTC CCAGgtgagg ggaacatggg atggttttgc atgtcagtga 200 
uLeuTrpLeu Pro 

aaaccctctc aagtcctgtt acctggcaac tctgctcagt caatacaata 250 

attaaagctc aatataaagc aataattctg gctcttctgg gaagacaatg 300 

ggtttgattt agattacatg ggtgactttt ctgttttatt tccaatctca 350 

gAIAOCAOOG GAGAAATTGT GTCGACGCAG TCTCCAGGCA CCCTGTCnT 400 
AspThrThrG lyGluIleVa ILeuThrGln SerProGlyT hrl^uSerle 

GTCTCCAGGG GAAAGAGOCA CCCICTOCIG CAQGGCCAGT CAGAGIGTTA 450 
uSerProGly GluArgAlaT hrLeuSerCy sArgAlaSer GlnSerValS 

GC2GCAGCIA. CTTAGCXnGG TACCAGCAGA AAGCIGGOCA GGCTCOCAGG 500 
erSerSerTy rLeuAlaTrp TyrGlnGlnL ysProGlyGl nAlaProArg 

CTOCTCATCT ATGGTGCATC CAQCAGGGCC ACTGGCATOC CAGACAGGTT 550 
LeuLeulleT yrGlyAlaSe rSerArgAla ThrGlylleP roAspArgPh 

CAG1GGCAGT GGGTCTGGGA OOOTCAC TCTCACCATC AGCAGACTQG 600 
eSerGlySer GlySerGlyT hrAspPheTh rLeuThrlle SerArgLeuG 

AQXTGAAGA TTTTQCAGTG TATTACTGTC AGCAGTATGG TAGCTCACCT 650 
l uProGluA s pPheAlaVal TyrTvrCvsG luGlnTyrGl ySerSerPro 

rrhArArTTTlA TTCAGCTTGA fiSH TCTGCAAGAC CITCATTGTT 700 

TACTAGATIA TACCAGCTGC TTCETTTACA GATAGCIGCT GCAATGACM 750 

CTCAATTTAG CATCTCICTC TQCTIGQGCA TTTTGGGGAT CTTAAAAAAG 800 

TAAIGCCTTG A3IATATTTTT GACTCTGATT CCTGCAITrr TOCTCAGAGC 850 

AAGATGGACA GOCAGGTTTA AQGACAGTIT CACAGTAATG GOCACTGGAT 900 
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MACACA3TC TCTGCAGACA. AATTTGAGCT ACCTTGATCT TACCTGGACA 50 

GCTGGGGACA CTGAGCTQGT GCTGAGTTAC TCAGATGCGC CAGCTCTGCA 100 

GCTGTCCCCA GCCTGCCCCA TCCCCTGCTC ATTTGCATGT TCCCAGAGCA 150 

CAACCTCCIG CCC1GAAGCC iTTATTAATfo G GCTGGTCAGA CTTTGTGCAG 200 

GAATCAGACC CAGTCAGGAC JOGCEIQ^ CfclGAGGG^ 250 

M=tAs pMetArgVal LeuAlaGlnL 

TCCTGGGGCT OCTGCTGCTC TGTTTCCCAG gtaaggatgg agaacactag 300 
euLeuGlyLe uLeuLeuLeu CysPhePro 

cagtttactc agcccagggt gctcagtact gctttactat tcagggaaat 350 

tctcttacaa catgattaat tgtgtggaca tttgttttta tgtttccaat 400 

CtcagGTGX AGAIGTGACA TOCAGATGAC CCAGTCTCCA TCCTCACTGT 450 

GlyAla ArgCysAspI leGlitffetTh rGlnSerPro SerSe rbeuS 
C'XGC ATC TGI AGGAGACAGA GTCACCATCA CTTGTCGGGC GAGTCAGGGT 500 
erAlaSerVa lGlyAspArg ValThrlleT hrCysArgAl aSerGlnGly 
ATTAGCAGCT GGTIAGOCTG GTATCAGCAG AAACCAGAGA AAGCCCCTAA 550 
IleSerSerT rpLeuAlaTr pdyrGlnGln LysProGluL ysAlaProLy 
GTOCCTCATC TATGCTGCAT CCAGTTTGCA AAGTGGGGTC CCATCAAGGT 600 
sSerLeuIle TyrAlaAlaS erSerLeuGl nSerGlyVal ProSerArgP 
TCAGCGGCAG TGGATCTGGG ACAGATTTCA CTCTCAOCAT CAGCAGCCTG 650 
heSerGlySe rGlySerGly ThrAspPheT hrLeuThrll eSerSerleu 
CAGCCTGAAG ATTTTQCAAC TTAITACTGC CAACAGTAIA ATAGTTACCC 700 
GlnProGluA spPheAlaTh r TyrTvrCys Gl nGlnTyrA snSerTyrPr 
arrH\rAf?TTl TTACACACOC AAESaS3xCAGGGAA GCAGATGTGT 750 
.0 

GAGGCTGGQC TGOQGCAGCT GCTTCTOCTC ATGOCTCCAT CAGCTGAGAG 800 
TCTTQCTCAG ATGCAGCCAC ACTCTGATGG TGTTGGTAGA TGGGGAC 847 
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